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Fig. 23A 
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Fig. 23B 
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Fig. 25A 
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Fig. 28 
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Fig. 29 
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ARRAYS OF NUCLEIC ACID PROBES ON 
BIOLOGICAL CHIPS 

CROSS-REFERENCE TO RELATED 
APPLICATION 

This is a Continuation of application Ser. No. 08/143,312, 
filed Oct. 26, 1993, now abandoned, which is a continuation 
in part of U.S. patent application Ser. No. 082,937, filed 25 
Jun. 1993, now abandoned, incorporated herein by refer- 
ence. 

Research leading to the invention was funded in part by 
N1H grant No. 1R01HG00813-01 and DOE grant No. 
DE-FG03-92-ER81275, and the government may have cer- 
tain rights to the invention. 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention provides arrays of oligonucleotide 
probes immobilized in microfabricated patterns on silica 
chips for analyzing molecular interactions of biological 
interest. The invention therefore relates to diverse fields 
impacted by the nature of molecular interaction, including 
chemistry, biology, medicine, and medical diagnostics. 

2. Description of Related Art 

Oligonucleotide probes have long been used to detect 
complementary nucleic acid sequences in a nucleic acid of 
interest (the "target" nucleic acid). In some assay formats, 
the oligonucleotide probe is tethered, i.e., by covalent 
attachment, to a solid support, and arrays of oligonucleotide 
probes immobilized on solid supports have been used to 
detect specific nucleic acid sequences in a target nucleic 
acid. See, e.g., PCT patent publication Nos. WO 89/10977 
and 89/11548. Others have proposed the use of large num- 
bers of oligonucleotide probes to provide the complete 
nucleic acid sequence of a target nucleic but failed to 
provide an enabling method for using arrays of immobilized 
probes for this purpose. See U.S. Pat. Nos. 5,202,231 and 
5,002,867 and PCT patent publication No. WO 93/17126. 

The development of VLSI PS™ technology has provided 
methods for making very large arrays of oligonucleotide 
probes in very small arrays. See U.S. Pat. No. 5,143,854 and 
PCT patent publication Nos. WO 90/15070 and 92/10092, 
each of which is incorporated herein by reference. U.S. 
patent application Ser. No. 082,937, filed Jun. 25, 1993, 
describes methods for making arrays of oligonucleotide 
probes that can be used to provide the complete sequence of 
a target nucleic acid and to detect the presence of a nucleic 
acid containing a specific nucleotide sequence. 

Microfabricated arrays of large numbers of oligonucle- 
otide probes, called "DNA chips" offer great promise for a 
wide variety of applications. New methods and reagents are 
required to realize this promise, and the present invention 
helps meet that need. 

SUMMARY OF THE INVENTION 

The present invention provides methods for making high- 
density arrays of oligonucleotide probes on silica chips and 
for using those probe arrays to detect specific nucleic acid 
sequences contained in a target nucleic acid in a sample. The 
invention also provides arrays of oligonucleotide probes on 
DNA chips, in which the probes have specific sequences and 
locations in the array to facilitate identification of a specific 
target nucleic acid. In another aspect, the invention provides 
methods for detecting whether one or more specific 
sequences of a target nucleic acid in a sample varies from a 
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previously characterized sequence or reference sequence. 
The methods of the invention can be used to detect varia- 
tions between a target and reference sequence, including 
single or multiple base substitutions, and deletions and 

5 insertions of bases, as well as detecting the presence, 
location, and sequence of other more complex variations 
between a target and reference sequence in a nucleic acid. 

The present invention provides arrays of oligonucleotide 
probes immobilized on a solid support. The arrays are 

10 preferably synthesized directly on the support using 
VLSIPS™ technology, but other synthesis methods and 
immobilization of pre -synthesized oligonucleotide probes 
can be used to make the oligonucleotide probe arrays, called 
"DNA chips", of the invention. In general, these arrays 
comprise a set of oligonucleotide probes such that, for each 

15 base in a specific reference sequence, the set includes a 
probe (called the "wild-type" or "WT" probe) that is exactly 
complementary to a section of the reference sequence 
including the base of interest and four additional probes 
(called "substitution probes"), which are identical to the WT 

20 probe except that the base of interest has been replaced by 
one of a predetermined set (typically 4) of nucleotides. In the 
preferred embodiment, one of the four substitution probes is 
identical to the wild type probe; the other three are comple- 
mentary to targets that have a single -base substitution at this 

25 position. 

In another aspect, the invention relates to the arrangement 
of individual probes in the array. In one embodiment, the 
probes are arranged on the chip so that probes for a given 
position in the sequence are adjacent, and probes for adja- 

30 cent positions in the reference sequence are also adjacent to 
one another on the chip. One method arranges the probes for 
a single base in a short column (alternately row) and 
arranges the columns in the order of the base position to 
form horizontal (alternately vertical) stripes. The wild-type 
and each of the substitution probes have specified positions 

35 within the column so that all the probes corresponding to an 
A substitution, for example, are in a single row. The stripes 
may be separated on the chip by a blank row or column. 

The DNA chips of the invention can be made in a wide 
number of variations. For some applications, leaving out the 

40 wild-type row, leaving out unimportant bases, pooling bases, 
including insertion and deletion probes, varying the length 
of the probes within a set to make the probes have the same 
or similar Tm relative to the target or to avoid secondary 
structure, varying the mutation position, using multiple 

4 5 probes for a single mutation, providing replicate probes or 
arrays, placing blank "streets" (no probe) between rows, 
columns, or individual probes, and using control probes may 
be appropriate. 

The present invention also provides DNA chips for detect- 

50 ing mutations associated with cystic fibrosis, including 
mutations in exons 4, 7, 9, 10, 11, 20, and 21 of the CFTR 
gene. The invention also provides DNA chips for detecting 
mutations in the p53 gene, a gene in which mutations are 
known to be associated with a wide variety of cancers. Other 

55 DNA chips of the invention provide probe arrays for detect- 
ing specific sequences of mitochondrial DNA, useful for 
identification and forensic purposes. The invention also 
provides DNA chips for detecting specific sequences of 
nucleotides or mutations associated with the acquisition of a 

60 drug resistant phenotype in an infectious organism, such as 
rifampicin or other drug resistant TB strains and HIV, in 
which mutations in an RNA polymerase gene are known to 
give rise to drug resistance. 

65 BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 shows how the tiling method of the invention 
defines a set of DNA probes relative to a target nucleic acid. 
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In the figure, the target is a DNA molecule, the probes are from the genomic DNA of an individual with wild-type 

single-stranded nucleic acids 16 nucleotides in length, and AF508 sequences; in panel B, the target nucleic acid origi- 

only a portion of the probes defined by the method is shown. nated from a heterozygous (with respect to the AF508 

FIG. 2 shows an illustrative tiled array of the invention mutation) individual, 

with probes for the detection of point mutations. The base at 5 FIG. 8, in sheets 1 and 2, corresponding to panels A and 

the position of substitution in each of the wild-type probes B of FIG. 7, shows graphs of fluorescence intensity versus 

is shown in the wild-type lane, and the shading shows the tiling position. The labels on the horizontal axis show the 

location of the substitution probe having the wild-type bases in the wild-type sequence corresponding to the posi- 

sequence. The SEQ ID. NOS. corresponding to the two tion of substitution in the respective probes. Plotted are the 

peptide sequences shown in the top portion of FIG. 2 are 311 10 intensities observed from the features (or synthesis sites) 

and 312, respectively. The SEQ ID. NOS. corresponding to containing wild-type probes, the features containing the 

the five peptide sequences listed at the bottom of FIG. 2 are substitution probes that bound the most target ("called"), and 

313, 314, 315, 313, and 316, respectively. the feature containing the substitution probes that bound the 

FIG. 3, in panels A, B, and C, shows an image made from tar g et witD toe second highest intensity of all the substitution 

the region of a DNA chip containing CFTR exon 10 probes; 15 probes ("2nd Highest"). The SEQ ID NOS. corresponding to 

in panel A, the chip was hybridized to a wild-type target; in the two peptide sequences shown in sheet 2 of FIG. 8 are 332 

panel C, the chip was hybridized to a mutant AF508 target; and 3l8 » respectively. 

and in panel B, the chip was hybridized to a mixture of the FIG. 9 shows the human mitochondrial genome; "O^" is 

wild-type and mutant targets. The SEQ ID. NOS. corre- the H strand origin of replication, and arrows indicate the 

spending to the four peptide sequences shown in FIG. 3 are 20 cloned unshaded sequence. 

317-320, respectively. FIG. 10 shows the image observed from application of a 

FIG. 4, in sheets 1-3, corresponding to panels A, B, and sample of mitochondrial DNA derived nucleic acid (from 

C of FIG. 3, shows graphs of fluorescence intensity versus the mt4 sample) on a DNA chip. 

tiling position. The labels on the horizontal axis show the ^ FIG. 11 is similar to FIG. 10 but shows the image 

bases in the wild-type sequence corresponding to the posi- " observed from the mt5 sample. 

tion of substitution in the respective probes. Plotted are the F IG. 12 shows the predicted difference image between the 

intensities observed from the features (or synthesis sites) mt4 and mt5 samples on the DNA chip based on mismatches 

containing wild-type probes, the features containing the between the two samples and the reference sequence. 

substitution probes that bound the most target ("called") and 3Q FIG. 13 shows the actual difference image observed for 

the feature containing the substitution probes that bound the tfae mt4 ^ ml5 samples 

target with the second highest intensity of all the substitution * A • , . c 

probes ("2nd Highest"). The SEQ ID. NOS. corresponding . FIG14 ' m sheets 1 and 2 J * ho * s a P lot of normalized 

to the two peptide sequences shown in sheet 1 of FIG. 4 are ulleDS1 f tl f across rows 10 and 11 of the array and a tabula- 

321 and 318, respectively; the SEQ ID. NOS. corresponding „ tl0D of thc mutatl0ns dctectcd - 

to the two peptide sequences shown in sheet 2 of FIG. 4 are FIG - 15 shows the discrimination between wild-type and 

322 and 318, respectively; and the SEQ ID. NOS. corre- mutant h y brids obtained with the chip. A median of the six 
sponding to the two peptide sequences shown in sheet 3 of normalized hybridization scores for each probe was taken; 
FIG. 4 are 323 and 318, respectively thc S^P* 1 P lots tnc ratio of tnc mcdia n score to the normal- 

FIG. 5, in panels A, B, and C, shows an image made from M ize f h y bridization «»™ versus mean counts. A ratio of 1.6 
a region of a DNA chip containing CFTR exon 10 probes; and mcan counts above 50 y ield no false P 0Sltlves - 
in panel A, the chip was hybridized to the wt480 target; in FIG - 16 lllustrates how the identity of thc base mismatch 
panel C, the chip was hybridized to the mu480 target; and in ma y influence *c ^UHy to discriminate mutant and wild- 
panel B, thc chip was hybridized to a mixture of thc l yP e sequences more than the position of the mismatch 
wild-type and mutant targets. The SEQ ID. NOS. corre- 45 within an oligonucleotide probe. The mismatch position is 
sponding to the peptide sequences shown in FIG. 5 are expressed as % of probe length from the 3'-end. The base 
324-327, respectively. change is indicated on the graph. 

FIG. 6, in sheets 1-3, corresponding to panels A, B, and Fia 17 P rovides a 5 ' t0 3' sequence listing of one target 

C of FIG. 5, shows graphs of fluorescence intensity versus corresponding to the probes on the chip. X is a control probe, 

tiling position. The labels on the horizontal axis show the 50 Positions that differ in the tar S et ( ie -> are mismatched with 

bases in the wild-type sequence corresponding to the posi- lhe P robe at the des »g naled site) are in bold. The SEQ ID. 

tion of substitution in the respective probes. Plotted are the N0 ; corresponding to the peptide sequence shown in FIG. 

intensities observed from the features (or synthesis sites) ^ 15 ^* 

containing wild-type probes, the features containing the FIG. 18 shows the fluorescence image produced by scan- 
substitution probes that bound the most target ("called"), and 55 nin S lne cni P described in FIG. 17 when hybridized to a 
the feature containing the substitution probes that bound the sample. 

target with thc second highest intensity of all thc substitution FIG. 19 illustrates the detection of 4 transitions in the 

probes ("2nd Highest"). Thc SEQ ID, NOS. corresponding target sequence relative to the wild-type probes on the chip 

to the two peptide sequences shown in sheet 1 of FIG. 6 are in FIG. 18. 

328 and 329, respectively; the SEQ ID. NOS. corresponding 60 FIG. 20 shows the alignment of some of the probes on a 

to the two peptide sequences shown in sheet 2 of FIG. 6 are p 53 DNA chip with a 12-mer model target nucleic acid. The 

330 and 329, respectively; and the SEQ ID. NOS. corre- SEQ ID. NOS. corresponding to the fourteen peptide 

sponding to the two peptide sequences shown in sheet 3 of sequences shown in FIG. 20 are 334-347, respectively. 

FIG. 6 are 331 and 329, respectively. F IG. 21 shows a set of 10-raer probes for a p53 exon 6 

FIG. 7, in panels A and B, shows an image made from a 65 DNA chip. The SEQ ID. NOS. corresponding to the thirteen 

region of a DNA chip containing CFTR exon 10 probes; in peptide sequences shown in FIG. 21 are 334 and 348-359, 

panel A, the chip was hybridized to nucleic acid derived respectively. 
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FIG. 22 shows that very distinct patterns are observed in the nucleotide sequence of a target nucleic acid with 
after hybridization of p53 DNA chips with targets having oligonucleotide probes of defined length. The length (L) of 
different 1 base substitutions. In the first image in FIG. 22, the probe is typically expressed as the number of nucleotides 
the 12-mer probes that form perfect matches with the or bases in a single-stranded nucleic acid probe. For pur- 
wild-type target are in the first row (top). The 12-mer probes 5 P°ses of the present invention, lengths ranging from 12 to 18 
with single base mismatches are located in the second, third, bases are preferred, although shorter and longer lengths can 
and fourth rows and have much lower signals. also De employed. To employ the tiling method, one syn- 

FIG. 23, in graphs 2, 3, and 4, graphically depicts the data thesizes a ^ of P robes defined b y the Particular nucleotide 

in FIG. 22. On each graph, the X ordinate is the position of sequence of interest in the target nucleic acid. For each base 

the probe in its row on the chip, and the Y ordinate is the 10 in the target DNA segment, one synthesizes a probe comple- 

signal at that probe site after hybridization. mentar y l L ° th L e subsequence of the target nucleic acid begin- 

p,^ , u ruL-j-* • j * * nine at that base and ending L-l bases to the 3'-side (see 

FIG. 24 shows the results of hybridizing mixed target FIG 1} 

populations of WT and mutant p53 genes to the p53 DNA * ... *. 

c y In a preferred embodiment of the invention, the probes are 

FIG. 25, in graphs 1-4, shows (see FIG. 23 as well) the > 5 arran S ed < ei *! er ^ immobilizalion, typically by covalent 

hybridization efficiency of a 10-mer probe array as com- ^chment of a pre-synthesized probe or by synthesis of the 

ared to a 12-mcr probe array probe on the substrate) on the substrate or chips in lanes 

™^ . . / ^r^v T * t . t- , stretching across the chip and separated, and these lanes are 

FIG. 26 shows an image of a P 53 DNA chip hybridized to ^ tumed arranged in blocks Qf preferably 5 , aneS) although 

a ar ^ e . ' , 20 blocks of other sizes will have useful application, as will be 

FIG. 27 illustrates how the actual sequence was read from a p parent from the following illustration. The first of these 

the chip shown in FIG. 26. Gaps in the sequence of letters five lanes> called the -^.^ laDe » contains pro b es 

in the WT rows correspond to control probes or sites. arra d in order of sequence> and all of the probes are 

Positions at which bases are miscalled are represented by com pi eme ntary to a specified wild-type nucleic acid 

letters in italic type in cells corresponding to probes in which 2J ^ other four lanes contain probe ^ts for detect- 

the WT bases have been substituted by other bases. The SEQ . a „ ; ble singl6 . base mutalions in the defined 

ID. NO. corresponding to the peptide sequence shown in in lurilj lhese probe xts are deflned by a p^on 

riu. Li is jou. of potential non-complementarity in the probe relative to the 

FIG. 28 illustrates the VLSIPS™ technology as applied to target (Le>j a single base mismatch) and the identity of the 

the light directed synthesis of oligonucleotides. light (hv) is 3Q nucleotide in the probe at that position (i.e., whether the 

shone through a mask (Mj) to activate functional groups nucleotide is an A, C, G, or T nucleotide). The position of 

(_OH) on a surface by removal of a protecting group (X). mismatch, also called the position of substitution, is prefer- 

Nucleoside building blocks protected with photoremovable ably se i ec ted to be near the center of the probes, i.e., position 

protecting groups (T-X, C-X) are coupled to the activated 7 of a probe of L«15. 

areas. By repeating the irradiation and coupling steps, very 35 For each probe in the wikMype lane> one synthe sizes four 

complex arrays of oligonucleotides can be prepared. probes (oQe fof each of the , anes Qther than the wild . tvpe 

FIG. 29 illustrates how the VLSIPS™ process can be used lanc ) f 7^ of lhcsc four probcs is identical to the corrc- 

to prepare "nucleoside corabinatorials" or oligonucleotides sponding wild-type probe but for the base at the position of 

synthesized by coupling all four nucleosides to form dimers, substitution, and the remaining probe is identical to the 

trimcrs, etc. 40 wild-type probe. This set of four substitution probes is 

FIG. 30 shows the deprotection, coupling, and oxidation preferably placed in a column directly below (or above) the 

steps of a solid phase DNA synthesis method. corresponding wild-type probe, thus creating an A-lane, a 

FIG. 31 shows an illustrative synthesis route for the C-lane, a G-lane, and a T-lane. FIG. 2 shows an illustrative 

nucleoside building blocks used in the VLSIPS™ method. tiled array of the invention with probes for the detection of 

FIG. 32 shows a preferred photoremovable protecting 45 point mutations. The base at the position of substitution in 

group, MeNPOC, and how to prepare the group in active each of the wild-type probes is shown in the wild-type lane, 

form. and the shading shows the location of the substitution probe 

FIG. 33 illustrates an illustrative detection system for having the wild-type sequence. Below are the probes that 

scanning a DNA chip. would be placed in the column marked by the arrow if the 

DETAILED DESCRIPTION OF THE 50 ^ length WCrC 15 thC P ° siti ° n ° f substitution werc 
INVENTION 3-CCGACTGCAGTCGTT (SEQ. ID. NO:l) 
Using the VLSIPS™ method, one can synthesize arrays S'-CCGACTACAGTCGTT (SEQ. ID. NO:2) 
of many thousands of oligonucleotide probes on a substrate, 3'-CCGACTCCAGTCGTT (SEQ. ID. NO:3) 
such as a glass slide or chip. The method can be used, for 55 3'-CCGACTGCAGTCGIT (SEQ. ID. NO:l) 
instance, to synthesize "combinatorial" arrays consisting of, 3'-CCGAClTCAGTCGTr (SEQ. ID. NO:4) 
for example, all possible octanucleotides. Such arrays can be Thus, the substitution lanes occupy four of the five lanes 
used for primary sequencing-by-hybridization on genomic separating successive wild-type lanes on the chip; the blocks 
DNA fragments or other nucleic acids or to detect mutations of five lanes can be separated by a sixth lane for measure- 
in a target nucleic acid for which the normal or "wild-type" eo mcnt of background signals. 

nucleotide sequence is already known. Using the preferred The DNA chips of the invention have a wide variety of 

method of the invention, one employs a strategy called applications. In one embodiment, the DNA chip is used to 

"tiling" to synthesize specific sets of probes or at spatially- select an optimal probe from an array of probes. In this 

defined locations on a substrate, creating the novel probe embodiment, an anay of probes of variable length and 

arrays and "DNA chips" of the invention. 6 5 sequences is synthesized and then hybridized to a target 

To illustrate the tiling method of the invention, consider nucleic acid of known sequence. The pattern of hybridiza- 

the problem of detecting mutalions at one or more position lion reveals the optimal length and sequence composition of 
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probes to detect a particular mutation or other specific base substitution and any deletion within the 192-base exon, 
sequence of nucleotides. In some circumstances, i.e., target including the three-base deletion known as AF508. As 
nucleic acids with repeated sequences or with high G/C described in detail below, hybridization of sub-nanomolar 
content, very long probes may be required for optimal concentrations of wild-type and AF508 oligonucleotide tar- 
detection. In one embodiment for detecting specific 5 S et nucleic acids labeled with fluorescein to these arrays 
sequences in a target nucleic acid with a DNA chip, repeat produces highly specific signals (detected with confocal 
sequences are detected as follows. The chip comprises scanning fluorescence microscopy) that permit discrimina- 
probes of length sufficient to extend into the repeat region tion between mutant and wild-type target sequences in both 
varying distances from each end. The sample, prior to homozygous and heterozygous cases. The method and chips 
hybridization, is treated with a labeled oligonucleotide that 10 of the . inv « ntl0n can also be used to detect other known 
is complementary to a repeat region but shorter than the full mu J? tl0ns m the CFTR & nc > f * detail below, 

length of the repeat. The target nucleic is labeled with a ™* n ? ost COmm0n CySUc fibrosis mutation is known as 
second, distinct label. After hybridization, the chip is AF5 ^ be c ause the mu atloa » a thiw-base delcUonthat 

scanned for probes that have bound both the labeled Urg m * e ? f & T° ^ ^ ^ 

a *u i u i j i* , t -j . / iaucicu WI s cl protem. The present invention provides DNA chips for 
and the labeled oligonucleotide probe; the presence of such 15 5 etecting AF5 g 8) one such chip ? esults ffom ^ (he 

Dound probes snows that at least two repeat sequences are tiling method t0 cxon 10 of the CFTR genCj me 

exon to 

P resent * which AF508 has been mapped. The tiling method involved 

A variety of methods can be used to enhance detection of the synthesis of a set of probes of a selected length in the 
labeled targets bound to a probe on the array. In one range of from 10 to 18 bases and complementary to subse- 
embodiment, the protein MutS (from £. coli) or equivalent 20 quences of the known wild-type CFTR sequence starting at 
proteins such as yeast MSH1, MSH2, and MSH3; mouse a position a few bases into the intron on the 5'-side of exon 
Rep-3, and Streptococcus Hex-A, is used in conjunction 10 and ending a few bases into the intron on the 3'-side. 
with target hybridization to detect probe-target complex that There was a probe for each possible subsequence of the 
contain mismatched base pairs. The protein, labeled directly given segment of the gene, and the probes were organized 
or indirectly, can be added to the chip during or after 25 into a "lane" in such a way that traversing the lane from the 
hybridization of target nucleic acid, and differentially binds upper left-hand corner of the chip to the lower righthand 
to homo- and heteroduplex nucleic acid. A wide variety of corner corresponded to traversing the gene segment base- 
dyes and other labels can be used for similar purposes. For by-base from the 5'-end. The lane containing that set of 
instance, the dye YOYO-1 is known to bind preferentially to probes is, as noted above, called the "wild-type lane." 
nucleic acids containing sequences comprising runs of 3 or 30 Relative to the wild-type lane, a "substitution" lane, called 
more G residues. the "A-lane", was synthesized on the chip. The A-lane 

The DNA chips produced by the methods of the invention probes were identical in sequence to an adjacent 

can be used to study and detect mutations in exons of human (immediately below the corresponding) wild-type probe but 

genes of clinical interest, including point mutations and contained, regardless of the sequence of the wild -type probe, 

deletions. In the following sections, the method of the 35 a dA residue at position 7 (counting from the 3'-end). In 

invention is illustrated by the detection of mutations in a similar fashion, substitution lanes with replacement bases 

variety of clinically and medically significant human nucleic dC, dG, and dT were placed onto the chip in a "C-lanc," a 

acid sequences. Thus, the invention is illustrated first with "G-lanc," and a "T-lanc," respectively. A sixth lane on the 

respect to the preparation of DNA chips for the detection of chip consisted of probes identical to those in the wild-type 

mutations associated with cystic fibrosis, then with DNA 40 lane but for the deletion of the base in position 7 and 

chips for the detection of human mitochondrial DNA restoration of the original probe length by addition to the 

sequences, then with DNA chips for the detection of muta- 5'-end the base complementary to the gene at that position, 
tions in the human p53 gene associated with cancer, and The four substitution lanes enable one to deduce the 

finally with respect to the detection of mutations in the HIV sequence of a target exon 10 nucleic acid from the relative 

RT gene associated with drug resistance. 45 intensities with which the target hybridizes to the probes in 

Detection of Cystic Fibrosis Mutations with DNA Chips the various lanes. The probe organization on the chip can be 

A number of years ago, cystic fibrosis, the most common conveniently columnar, and the set of probes consisting of a 

severe autosomal recessive disorder in humans, was shown wild-type probe and four corresponding substitution probes 

to be associated with mutations in a gene thereafter named is referred to as a "column set." One and only one of the four 

the Cystic Fibrosis Transmembrane Conductance Regulator 50 substitution probes in a column set has exactly the same 

(CFTR) gene. The sequences of the exons and parts of the sequence as the wild-type probe in the set. Those of skill in 

introns in the gene are known, as are the changes corre- the art will appreciate that, in other embodiments of the 

sponding to several hundred known mutations. Several tests invention, one could delete one or more lanes or columns 

have been developed for detecting the most frequent of these and still benefit from the invention. Various versions of such 

mutations. The present invention provides CFTR gene oli- 55 exon 10 DNA chips were made as described above with 

gonucleotide arrays (DNA chips) that can be used to identify probes 15 bases long, as well as chips with probes 10, 14, 

mutations in the CFTR gene rapidly and efficiently. and 18 bases long. For the results described below, the 

The methods used to make the high-density DNA chips of probes were 15 bases long, and the position of substitution 

the invention allow probes for long stretches of DNA coding was 7 from the 3'-end. 

regions to be directly "written" onto the chips in the form of 60 To demonstrate the ability of the chip to distinguish the 

sets of overlapping oligonucleotides. These methods have AF508 mutation from the wild-type, two synthetic target 

been used to develop a number of useful CFTR gene chips, nucleic acids were made. The first, a 39-mer complementary 

one illustrative chip bears an array of 1296 probes covering to a subsequence of exon 10 of the CFTR gene having the 

the full length of exon 10 of the CFTR gene arranged in a three bases involved in the AF508 mutation near its center, 

36x36 array of 356 \m elements. The probes in the array can 65 is called the "wild-type" or wt508 target, corresponds to* 

have any length, preferably in the rang,e of from 10 to 18 positions 111-149 of the exon, and has the sequence shown 

residues and can be used to detect and sequence any single- below: 
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S'-CATTAAAGAAAATArCATCTTTGGTGTTTCCTAT- whose point of substitution corresponds to the T at the 3'-end 

GATGA (SEQ. ID NO: 5). of the deletion was very close to background. Following that 

The second, a 36-mer probe derived from the wild-type pattern, the wild-type probe whose point of substitution 

target by removing those same three bases, is called the corresponds to the middle base (also a T) of the deletion 

"mutant" target or mu508 target and has the sequence shown 5 bound still less target. However, the probe in the T-lane of 

below, first with dashes to indicate the deleted bases, and that column set bound the target very well, 

then without dashes but with one base underlined (to indi- Examination of the sequences of the two targets reveals 

cate the base detected by the T-lane probe, as discussed that the deletion places an A at that position when the 

below): sequences are aligned at their 3'-ends and that the T-lane 

5 ' " C A T..7. AAAGAAAA TATCAT 10 probe is complementary to the mutant target with but two 

TGGTGTTTCCTATGATGA; (SEQ. ID N O: 6) mismatches near an end (shown below in lower-case letters 

S'-CATTAAAGAAAATATCATTGGTGTTTCCTATGATGA. with the position of substitution underlined): 

(SEQ. ID NO: 7) Target: 5'-CATTAAAGAAAATATCATTGGTGT- 

Both targets were labeled with fluorescein at the 5'-end. TTCCTATGATGA 

In three separate experiments, the wild-type target, the is Probe: 3'-TagTAGTAACCACAA (SEQ. ID NO:8) 

mutant target, and an equimolar mixture of both targets was Thus the T-lane probe in that column set calls the correct 

exposed (0.1 nM wt508, 0.1 nM mu508, and 0.1 nM wt508 base from the mutant sequence. Note that, in the graph for 

plus 0.1 nM mu508, respectively, in a solution compatible the equimolar mixture of the two targets, that T-lane probe 

with nucleic acid hybridization) to a CF chip. The hybrid- binds almost as much target as does the A-lane probe in the 

ization mixture was incubated overnight at room 20 same column set, whereas in the other column sets, the 

temperature, and then the chip was scanned on a reader (a probes that do not have wild-type sequence do not bind 

confocal fluorescence microscope in photon-counting mode; target at all as well. Thus, that one column set, and in 

images of the chip were constructed from the photon counts) particular the T-lane probe within that set, detects the AF508 

at several successively higher temperatures while still in mutation under conditions that simulate the homozygous 

contact with the target solution. After each temperature 25 case and also conditions that simulate the heterozygous case, 

change, the chip was allowed to equilibrate for approxi- The present invention thus provides individual probes, 

mately one-half hour before being scanned. After each set of sets of probes, and arrays of probe sets on chips, in specific 

scans, the chip was exposed to denaturing solvent and patterns, as the probes provide important benefits for detect- 

conditions to wash, i.e., remove target that had bound, the ing the presence of specific exon 10 sequences. The 

chip so that the next experiment could be done with a clean 30 sequences of several important probes of the invention are 

cm P* shown below. In each case, the letter "X" stands for the point 

The results of the experiments are shown in FIGS. 3, 4, 5, of substitution in a given column set, so each of the 

and 6. FIG. 3, in panels A, B, and C, shows an image made sequences actually represents four probes, with A, C, G, and 

from the region of a DNA chip containing CFTR exon 10 T, respectively, taking the place of the "X." Sets of shorter 

probes; in panel A, the chip was hybridized to a wild-type 35 probes derived from the sets shown below by removing up 

target; in panel C, the chip was hybridized to a mutant delta to five bases from the 5'-end of each probe and sets of longer 

508 target; and in panel B, the chip was hybridized to a probes made from this set by adding up to three bases from 

mixture of the wild-type and mutant targets. FIG. 4, in sheets the exon 10 sequence to the 5'-end of each probe, arc also 

1-3, corresponding to panels A, B, and C of FIG. 3, shows useful and provided by the invention, 

graphs of fluorescence intensity versus tiling position. The 40 3 , -TTTATAXTAGAAACC (SEQ. ID NO:9) 

labels on the horizontal axis show the bases in the wild-type 3'-TTATAGXAGAAACCA (SEQ. ID NO:10) 

sequence corresponding to the position of substitution in the 3'-TATAGTXGAAACCAC (SEQ. ID NO:ll) 

respective probes. Plotted are the intensities observed from 3-ATAGTAXAAACCACA (SEQ. ID NO:12) 

the features (or synthesis sites) containing wild-type probes, 3'-TAGTAGXAACCACAA (SEQ. ID NO:13) 

the features containing the substitution probes that bound the 45 3'-AGTAGAXACCACAAA (SEQ. ID NO:14) 

most target ("called"), and the feature containing the sub- 3'-GTAGAAXCCACAAAG (SEQ. ID NO:15) 

stitution probes that bound the target with the second highest 3 -TAGAAAXCACAAAGG (SEQ. ID NO:16) 

intensity of all the substitution probes ("2nd Highest"). 3-AGAAACXACAAAGGA (SEQ. ID NO: 17) 

These figures show that, for the wild-type target and the Although in this example the sequence could not be 

equimolar mixture of targets, the substitution probe with a 50 reliably deduced near the ends of the target, where there is 

nucleotide sequence identical to the corresponding wild- not enough overlap between target and probe to allow 

type probe bound the most target, allowing for an unam- effective hybridization, and around the center of the target, 

biguous assignment of target sequence as shown by letters where hybridization was weak for some other reason, per- 

near the points on the curve. The target wt508 thus hybrid- haps high AT-content, the results show the method and the 

ized to the probes in the wild-type lane of the chip, although 55 probes of the invention can be used to detect the mutation of 

the strength of the hybridization varied from probe-to-probe, interest. The mutant target gave a pattern of hybridization 

probably due to differences in melting temperature. The that was very similar to that of the wt508 target at the ends, 

sequence of most of the target can thus be read directly from where the two share a common sequence, and very different 

the chip, by inference from the pattern of hybridization in in the middle, where the deletion is located. As one scans the 

the lanes of substitution probes (if the target hybridizes most 60 image from right to left, the intensity of hybridization of the 

intensely to the probe in the A-lanc, then one infers that the target to the probes in the wild-type lane drops off much 

target has a T in the position of substitution, and so on). more rapidly near the center of the image for mu508 than for 

For the mutant target, the sequence could similarly be wt508; in addition, there is one probe in the T-lane that 

called on the 3'-side of the deletion. However, the intensity hybridizes intensely with mu508 and hardly at all with 

of binding declined precipitously as the point of substitution 65 wt508. The results from the equimolar mixture of the two 

approached the site of the deletion from the 3'-end of the targets, which represents the case one would encounter in 

target, so that the binding intensity on the wild-type probe testing a heterozygous individual for the mutation, are a 
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blend of the results for the separate targets, showing the 
power of the invention to distinguish a wild-type target 
sequence from one containing the AF508 mutation and to 
detect a mixture of the two sequences. 

The results above clearly demonstrate how the DNA chips 5 
of the invention can be used to detect a deletion mutation, 
AF508; another model system was used to show that the 
chips can also be used to detect a point mutation as well. One 
of the more frequent mutations in the CFTR gene is G480C, 
which involves the replacement of the G in position 46 of 10 
exon 10 by a T, resulting in the substitution of a cysteine for 
the glycine normally in position #480 of the CFTR protein. 
The model target sequences included the 21-mer probe 

wt480 to represent the wild-type sequence at positions aam F iw uum a muiviuuai carrying omy me wua-type gene 

37-55 of exon 10: S'-CCTTCAGAGGGTAAAATTAAG 15 and an individual heterozygous for AF508 were amplified by 

(SEQ. ID NO:18) and the 21-mer probe mu480 to represent PCR using exon 10 primers containing the promoter for T7 

5'-CCnCAGAGTGTAAAATTAAG RNA polymerase. Illustrative primers of the invention are 
(bbQ. ID NO:19). shown beIow 



terns. The wild-type sequence could easily be read from the 
chip, but the probe that bound the mu480 target so well when 
only the mu480 target was present also bound it well when 
both the mutant and wild- type targets were present in a 
mixture, making the hybridization pattern easily distinguish- 
able from that of the wild-type target alone. These results 
again show the power of the DNA chips of the invention to 
detect point mutations in both homo- and heterozygous 
individuals. 

To demonstrate clinical application of the DNA chips of 
the invention, the chips were used to study and detect 
mutations in nucleic acids from genomic samples. Genomic 
samples from a individual carrying only the wild-type gene 



Exoq Name Sequence 



10 CH9-T7 TAATACGACTCACTATAGGGAGatgacctafltaatgatgggtU (SEQ. ID NO20) 

10 CFE10C-T7 TAATACGACTCACTATAGGGAGtagtgtgaagggttcatatgc (SEQ ID NO'21) 

10 CFU0C-T3 CrCGGAATTAACCCrCACTAAAGGtagtgtgaagggttcatatg (SEQ ID NC> : 22) 
10, 11 CH10-T7 TAATACGACrCACTATAGGGAGagcatactaaaagtgactctc (SEQ ID NO-23) 

11 CFillc-T7 TAATACGACTCACTATAGGGAGacatgaatgacatttacagcaa (SEQ ID NO-24) 
11 CFillc-T3 CGGAATTAACCCTCACIAAAGGacatgaatgacatttacagcaa (SEQ ID NQ25) 



Jh eX , Pen Mcn ' ' °^ Chip was hvbndized to 30 ™ese primers can be used to amplify exon 10 or exon 11 

each of the ^ sequences; in another embodiment, multiplex PCR is 

scanned with a confocal microscope. FIG. 5, in panels A, B, employed, using two or more pairs of primers to amplify 

and C, shows an image made from the region of a DNA chip more than one exon at a time 

hvhS ?T "Ton 0 > Pr ° beS; iD PaD , eI > thC Chip W3S 7116 P roduct of amplification was then used as a template 
hybridized to the w 480 target; m panel C, the chip was 35 for the RNA polymerase, with fluoresceinated UTP present 
hybridized to the mu480 target; and m panel B, the chip was to label the RNA product. After sufficient RNA was made 
hybridized to a mixture of the wild-type and mutant targets. it was fragmented and applied to an exon 10 DNA chip for* 
FIG. 6, in sheets 1-3, corresponding to panels A, B, and C 15 minutes, after which the chip was washed with hybrid- 
rilino 1; % ™ ^ fi^scerjce intensity versus ization buffer and scanned with the fluorescence micro- 
tiling position The labels on the horizontal axis show the 40 scope. A useful positive control included on many CF exon 
bases in the wild-type sequence corresponding to the posi- 10 chips is the 8-mer 3'-CGCCGCCG-5' FIG 7 in panels 
tion of substitution in the respective probes. Plotted are the Aand B, shows an image made from a region of a DNA chip 
intensities observed from the features (or synthesis sites) containing CFTR exon 10 probes; in panel A, the chip was 
containing wild-type probes the features containing the hybridized to nucleic acid derived from the genomic DNAof 
substitution probes that bound the most target ("called"), and 45 an individual with wild-type AF508 sequences- in panel B 
he feature containing the substitution probes that bound the the target nucleic acid originated from a heterozygous (with 
target with the second highest intensity of all the substitution respect to the AF508 mutation) individual. FIG. 8, in sheets 
probes ( 2nd Highest ) 1 and 2, corresponding to panels A and B of FIG. 7 shows 
TTicsc figures show that the chip could be used to graphs of fluorescence intensity versus tiling position 

JSSTV, 7 - thC CCntCr ° f thC Urget 50 ™ eSC fi S Ures show that the se <l uence of the wild'-type 

^ u discrimination against mismatches is quite RNA can be called for most of the bases near the mutation 
good throughout the sequenced region. When the DNA chip In the case of the AF508 heterozygous carrier, one particular 

Irtin^nTrh, h h^K 111114 ^ 0nly b the Pr ° be ' lhe Same 0ne that ^tinguished so clearly between 

portion of the chip shown bound the target well: the probe the wild-type and mutant oligonucleotide targets in the 

in the set of probes devoted to identifying the base at 55 model system described above, in the T-lane binds a large 

position 46 in exon 10 and that has an A m the position of amount of RNA, while the same probe binds little RNAfrom 

substitution and so is fully complementary to the central the wild-type individual. These results show that the DNA 

P ^K° n u° u mUt i 01 targeL ^ ° ther pr0bes in that re S ion chi t* of the invention are capable of detecting the AF508 

of the chip have at least one mismatch with the mutant target mutation in a heterozygous carrier 

Li™,' 7 b T«nT Ch 1CSS i° f [L 10 SpitC ° f tbal faCt ' thc 60 ^ the P rescnt invcnti0Q P rovid <* methods for synthe- 

scqucncc of mu480 for several positions to both sides of the sizing large numbers of oligonucleotide probes on a glass 

mutation can be read from the chip, albeit with much- substrate and unique probe sets in a defined array in which 

educed intensities from those observed with the wild-type the probes are arranged in the array by the "tiling" method 

ai ££ ' u . of invention. The DNA chips produced by the method 

mi^rf ^lu alS ° H sh0W that ' w u hen L the two taf 8 ets ™* 65 can be used to detect mutations in particular sequences of a 

mrxed together and exposed to the chip, the hybridization target nucleic acid, such as genomic DNA or RNA produced 

pattern observed was a combination of the other two pat- from transcription of an ampliHed genomic DNA These 
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chips can be used to detect both point mutations and small some applications to using a minimal set of oligonucleotides 

deletes. Moreover, the pattern of hybridization to the chip specific to the sequence of interest, rathe 2 a set of al! 

allows inferences to be drawn about the sequences of the nossible N m^T™ „f th~ T V ■ , ! « 

mutant DNAs possioie in -mere, bome of these advantages include: (i) each 

For example, in the model system involving the cystic 5 ? ^ "" y * highIy whet °« « ** 
fibrosis point mutation G480C, the A-lane pfobe whos^ ^'f™™ °* uai .V> DOn speafi C hybridization is mini- 
position of substitution corresponds to the position of the ( ' U) " ' S strai 8 ht£orward t0 correlate hybridization 
mutation does not bind much wild-type target because in the dlfferences wlh sequence differences, particularly with ref- 
wild-type sequence, a G occupies that position. However, it 10 the h y bndlzatioa P a "em of a known standard; and 
binds mutant target very well, allowing one to infer correctly to ^ the . abili . t y 'o address each probe independently during 
that the mutation involves a change of that G to a T. synthesis, using high resolution photolithography, allows the 
Similarly, in the case of the three-base deletion in cystic array 10 be desi 8 ned and optimized for any sequence. For 
fibrosis known as AF508, the T-lane probe that binds mutant 6xa mple the length of any probe can be varied independently 
target so intensely is responding to the fact that the deletion of tne otn ers. 

hasbrought a CAT sequence into the position occupied by 15 The present invention illustrates these advantages bv 

• CIT sequence in the wild-type target. The DNA chips of providing DNA chips and analytical methods for delecting 

the invention can be used to detect and sequence not only specific sequences of human mitochondrial DNA In one 

known mutaUons in an organism^ genome but also new preferred embodiment, the invention provides a DNA Z 

T hnnTon PreVK ? Iy chara f et ! zed ™ 6 ° NA Chips and for "I™™ contained in' 1.3 kb fragment 

methods of the invention can also be used to detect specific 20 human mitochondrial DNA from the "D-loop" redon the 

sequences in other CFTR exons as well as other human most polymorphic region of human mitochondrial DNA 

s^s^~* and chn,cal genetic analysfe ' ^id?S co r ses - a ? of h 269 r r,appin " s £ 

Detertinn nf w.Jfi, u \a * i_ j * i nucleotide probes of varying length in the range of 9~*14 

^^^lmf^ Uma Mlt ° Ch0ndrial DNA - c1 -^ with varying overlaps arranged in -600x600 

Tc nZT k ,u P * . 25 micron fcatures or synthesis sites in an array 1 cmxl cm in 

As noted above, the present invention provides DNA size. The probes on the chip are shown in columnar forr^ 

chips on which a known DNA sequence is represented as an below. An illustrative mitochondrial DNV^T m«™ 

way of overlapping ohgonucleotides on a solid support. tion comprises the following probes (X, Sordini £™ 

This set of oligonucleotides is used to probe a target nucleic shown, followed by the sequence- «DL3" ZrneTs the 

be detected. As also noted above, there are advantages in surface.) P 
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DL3AGTCGGGTATTT (SEQ ID. NO:26) 

DL3GGGTATTTAGTT (SEQ ID. NO:27) 

DUTIXGTrTATCCAA (SEQ ID. NO:2tf) 

DL3ATCCAAACCAGG (SEQ ID. NO:29) 

DL3ACCAGGATCGGA (SEQ ID. NO:30) 

D L3CGTGTGTGTGTGG (SEQ ID. NO:31) 
DL3CXjTGTGTGTGTGGC (SEQ ID. NO:32) 
D L3TCGTGTGTGTGTGG (SEQ ID. NO:33) 

DL3GTAGGATGGGTC (SEQ ID. NO:34) 

DL3AGGATGGGTCGT (SEQ ID. N035) 

DL3GATGGGTCGTGT (SEQ ID. NO:36) 

DL3TGGCGACGATTG (SEQ ID. NO:37) 

D L3GCG ACGATTGGG (SEQ ID. NO:38) 
DUTGGGGGGGA 
DUGAGGGGGCG 

DL3GGAGGGGGCGA (SEQ ID. NO:39) 

DL3GAGGGGGOGA (SEQ ID. NO:40) 

DL3GGCITGGTIXJG (SEQ ID. N041) 

DL3GG'lTGGTrrGGG (SEQ ID. NO-42) 

DL3TGGGGTTTCTAG (SEQ ID. NO:43) 

D L3GTTTCTAGTGGG (SEQ ID. NO:44) 

DL3AGTGGGGGGTGT (SEQ ID. NO:45) 

DL3GGGGTGTCAAAT (SEQ ID. NO:46) 

D L3GTC AAATACATCG (SEQ ID. NO:47) 

DL3ACATCGAATGGAG (SEQ ID. NO.-48) 

DL3CGAATGGAGGAG (SEQ ID. NO:49) 

DL3GAGGAGTTTCGT (SEQ ID. NO:50) 

DL3TTTCGTTATGTGA (SEQ ID. NO:51) 

DL3ATGTGACTTTTAC (SEQ ID. NO:52) 

DL3GAC 1 ' i " I TACAAAT (SEQ ID. NO:53) 

DL3AAATCTGCCCGA (SEQ ID. NO:54) 

DUAATCTGCCCGAG (SEQ ID. NO:5S) 

DL3CCCGAGTGTAGT (SEQ ID, NO:56) 

DL3 AGTCTAGTGGGG (SEQ ID. NO:57) 

D L3 GGG AGGGTG AG (SEQ ID. NO:58) 

DL3GGTGAGGGTATG (SEQ ID. NO:59) 

DUGGTATGATGATTAG (SEQ ID. NO:60) 

DUGATTAGAGTAAGT (SEQ ID. NO:61) 

DL3TTAGAGTAAGTTA (SEQ ID. NO:62) 



9 

10 

11 

12 

13 

14 

15 

16 

0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 



DL3GGTAGGATGGGT 

DL3GGATGGGTCGTG 

DL3GGTCGTGTGTGT 

D L3GTGTGTGTGGCG 

DL3TGTGGCGACGAT 

DL3GACGATTGGGGT 

DL3ATTGGGGTATGG 

D L3GTATGGG GCTTG 

DL3GGATTGTGGTCG 

DL3TGGTCGGATTGG 

D L3GG ATTGGTCTAAA 

DUTCTAAAGTTTAAA 

DL3GTTTAAAATAGAA 

DL3ATAGAAAAACCG 

DL3AGAAAAACCGC 

DL3AACCGCCATAC 

DL3CCATACGTGAAAA 

DL3ACGTGAAAATTGT 

DL3 AATTGITT AGTG GG 

DL3TGTCAGTGGGGG 

DL3TGGGGTTGA 

DL3GGGTTGATTGTGT 

DL3TTGTGTAATAAAA 

DL3 AATAAAAGGGG A 

DL3TAAAAGGGGAGG 

DL3GTTTTTTAAAGG 

DL3TTTTAAAGGTGG 

DL3AGGTGGTTTGG 

DL3TTGGGGGGGAG 

DL3GGAGGGGGCG 

DL3GGGGCGAAGAC 

DL3GAAGACCGGATG 

DL3CCGGATGTCGTG 

D L3GTCGTG AATTTGT 

D L3CGTG AATTTGTGT 

DUTTGTGTAGAGACG 

DL3TAGAGACGGTTT 

DL3ACGGTTTGGGG 

DL3TGGGGTTTTTGT 

DL3GGGTTTTTGTTT 



(SEQ ID. NO:67) 
(SEQ ID. NO:68) 
(SEQ ID. NO:69) 
(SEQ ID. NO:70) 
(SEQ ID. NO:71) 
(SEQ ID. NO:72) 
(SEQ ID. NO:73) 
(SEQ ID. NO:74) 
(SEQ ID. NO:75) 
(SEQ ID. NO:76) 
(SEQ ID. NO:77) 
(SEQ ID. NO: 78) 
(SEQ ID. NO:79) 
(SEQ ID. NO:80) 
(SEQ ID. NO:81) 
(SEQ ID. NO:82) 
(SEQ ID. NO;83) 
(SEQ ID. NO:84) 
(SEQ ID. NO:85) 
(SEQ ID. NO:86) 
(SEQ ID. NO:87) 
(SEQ ID. NO:8S) 
(SEQ ID. NO;89) 
(SEQ ID. NO:90) 
(SEQ ID. NO:91) 
(SEQ ID. NO:92) 
(SEQ ID. NO:93) 
(SEQ ID. NO:94) 
(SEQ ID. NO:95) 
(SEQ ID. NO:96) 
(SEQ ID. NO:97) 
(SEQ ID. NO:98) 
(SEQ ID. NO:99) 
(SEQ ID. NO: 100) 
(SEQ ID. NO:101) 
(SEQ ID. NO:102) 
(SEQ ID. NO:103) 
(SEQ ID. NO: 104) 
(SEQ ID. NO:105) 
(SEQ ID. NO:106) 
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D L3 AAGTTATGTTGGG 
DL3GTTGGGGGCG 
DL3GGGGGGGGTA 
DL3GCGGGTAGGAT 
DL3ACACAATTAATTAA 
D L3 AATTAATTACG AA 
D L3TACG AACATCCTG 
DL3 ACG AAC ATCCTGT 
D L3TCCTGTATTATTA 
D L3 GTATTATTATTGTT 
D L3 ATTGTTAAA CTTA 
DL3AAACTTACAGACG 
DL3ACAGACGTGTCG 
DL3 GTGTCGGTG AAA 
D L3 GTG AAAGGTGTGT 
DL3GGTGTGTCTGTAG 
DL3TGTGTCTGTAGTA 
DL3GTAGTATTGTTTT 
DL3AGTATTGTTTTTT 
DL3CCTCGTGGGATA 
DL3TGGGATACAGCG 
DL3 G ATAC AGCGTCAT 
DL3GCGTCATAGACAG 
DL3AGACAGAAACTAA 
DL3CAGAAACTAAGGA 
D L3TAAGGACGG AGT 
DL3 G ACGG AGTAGG A 
DL3GTAGGATAATAAA 
DUTAATAAATAGCG 
DL3ATAGCGTAGGAT 
DL3TAGCGTAGGATG 
D L3 AGG ATGC AAGTT 
DL3ATGCAAGTTATAA 
DIJGTTATAATGTCCG 
D I_3 ATGTCCGCTTGT 
DL3TCCGCriTJTATG 
DL3GTGAGITOCCCTC 
D L3TGCCXTCG AG AG 
D L3 CCTCG AG AGGTA 
DL3AGAGGTACGTAA 
DUACGTAAACCATA 
DL3 ACC ATAAAAGCAG 
DL3AAAGCAGACCC 
DL3 AG ACCCCCCAT 
DL3CCCCCATACGT 
DL3CATACGTGCGCT 
DL3GTGCGCTATCAG 
DUGCGCTATCAGTA 
DI 3TCAGTA ACGCTC 
D L3 GTAACGCTCTGC 
DL3AGTCTATCCCCA 
DL3ATCCCCAGGGA 
DL3CAGGGAACTGGT 
DL3ACTGGTGGTAGG 
DL3CTGGTGGTAGGA 
DL3GTAGGAGGCACA 
DL3GGCACATTTAGT 
DL3TTTAGTTATAGGG 
DL3AGGTTTACGGTG 
DL3TACGGTGGGGA 
D L3GTGGGG AGTG G 
DL3GGGAGTGGGTGA 
D 1 3GGGTG ATCCTATG 
DL3CCTATGGTTGTTT 
DL3GGTTGTTTGGATG 
D L3 GTITGG ATGG GT 
D L3 ATGGGTG GG AAT 
DL3GGGAATTGTCATG 
D L3 GTC ATGTATC ATGT 
D L3TCATGTATTTCGG 
DL3TATTTCGGTAAA 
D L3TTCGGTAAATGG 
DL3GTAAATGGCATGT 
DL3 GCATGTAATCGTG 
DUGTAATCGTGTAAT 
DL3GGGAGGGGTAC 
D L3 GGGTACG A ATGT 
DIJACGAATGTTCGTT 
D L3TGTTCGTTC ATGT 
DL3CGTTCATGTCG1T 
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(SEQ ID. 
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DL3TTGTTTCTTGGG 

D L3TCTTGGG ATTGTG 

D L3TGTATG AATG ATTT 

DL3TGATTTCACACAA 

DUCTCTGCGACCTC 

DL3GACCTCGGCCT 

DL3TCGGCCTCGTG 

DL3GATGAAGTCCCAG 

DUAGTCCCAGTATTT 

DL3GTATTTCGGATTT 

D L3TCGG ATTTATCG 

DL3GATTTATCGGGT 

DL3ATCGGGTGTGCA 

D L3TGTGCAAGGGG A 

DL3CAAGGGGAATTT 

DL3GAATTTATTCTGTA 

D L3TCTGTAGTGCTAC 

DL3GTAGTGCTACCT 

DL3GCTACCTAGTAG 

DL3CTAGTAGTCCAGA 

D L3TCCA G ATA9TGGG 

DL3AGATAGTGGGATA 

DL3GGGATAATTGGT 

DL3TAATTGGTGAGTG 

D L3TATA GGGCGTGT 

DL3GGGCGTGTTCTCA 

DL3GTGTTCTCACGAT 

DL3TCACGATGAGAGG 

DL3ATGAGAGGAGCG 

DL3AGGAGCGAGGC 

DUCGAGGCCCGG 

DUGCCCGGGTATT 

D 1 3CGGGTATTGTG A 

DL3GTGAACCCCCAT 

DL3CCCCATCGATTT 

DUATCGArrrcAcrr 

DLJITrCACITGACAT 

D L3TTG ACATAG AGCT 

DL3TAGAGCTGTAGAC 

DL3GTAGACCAAGGA 

DUACCAAGGATGAAG 

DL3CGTGTAATGTCAG 

D L3TGTC AGTTTAGGG 

D L3TC AGTTTAGGG A 

DL3TAGGGAAGAGCA 

D L3 A AG AGC AGGGGT 

DL3CAGGGGTACCTA 

DL3GGTACCTACTGG 

D L3TACTGGGGGG A 

DL3GGGGGAGTCTAT 

DUCATGTATTTTTGG 

DL3TTTTGG GTTAGG 

DL3GGGTTAGGATGT 

DL3GGATGTAGTTTTG 

D L3TGTAGTTTTGGG 

DL3TTTGGGGGAGG 

D L3GGGTTC ATAACTG 

D L3 ATAACTG AGTGGG 

DL3AACTGAGTGGGT 

DL3GTGGGTAGTTGT 

DL3GTAGTTGTTGGC 

D L3GTTGGCG ATACA 

DL3CGATACATAAAAG 

D L3TAAAAGCATGTAA 

DL3GCATGTAATGACG 

DL3 ATG ACGGTCG GT 

D L3GTCGGTGGTACT 

DUGGTACITATAACA 

DL3TCGATTCTAAGAT 

D UTAAG ATTAAATTT 

DL3AAATTTGAATAAG 

DL3AATAAGAGACAAG 

D L3 AAG AG AC AAG AAA 

DL3AAGAAAGTACCC 

DL3AAAGTAOCCCTT 

DL3CCCCTTCGTCTA 

DL3CTTCGTCTAAAC 

DL3CTAAACCCATGG 

DUAACCCATGGTGG 

DL3TGGTGGGTTCAT 



(SEQ ID. 

(SEQ ID. 

(SEQ ID. 

(SEQ ID, 

(SEQ ID. 

(SEQ ID. 

(SEQ ID. 

(SEQ ID. 

(SEQ ID. 

(SEQ ID. 

(SEQ ID. 

(SEQ ID. 

(SEQ ID. 

(SEQ ID. 

(SEQ ID. 

(SEQ ID. 

(SEQ ID. 

(SEQ ID. 

(SEQ ID. 

(SEQ ID. 

(SEQ ID. 

(SEQ ID. 

(SEQ ID. 

(SEQ ID. 

(SEQ ID. 

(SEQ ID. 

(SEQ ID. 

(SEQ ID. 

(SEQ ID. 

(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 



NO:107) 
NO:108) 
NO:109) 
NO:110) 
NO:157) 
NO:158) 
NO: 159) 
NO: 160) 
NO: 161) 
NO:162) 
NO:163) 
NO: 164) 
NO: 165) 
NO:166) 
NO: 167) 
NO: 168) 
NO:l 69) 
NO: 170) 
NO:171) 
NO:172) 
NO: 173) 
NO:174) 
NO: 175) 
NO: 176) 
NO:177) 
NO:178) 
NO: 179) 
NO:180) 
NO:181) 
NO:182) 
NO:183) 
NO: 184) 
NO: 185) 
NO:186) 
NO: 187) 
NO:188) 
NO: 189) 
NO: 190) 
NO: 191) 
NO: 192) 
NO: 193) 
NO:194) 
NO: 195) 
NO: 196) 
NO:197) 
NO:198) 
NO: 199) 
NO:200) 
NO:201) 
NO:202) 
NO:246) 
NO:247) 
NO:248) 
NO:249) 
NO:250) 
NO:25l) 
NO:252) 
NO:253) 
NO:254) 
NO:255) 
NO:256) 
NO:257) 
NO:258) 
NO:259) 
NO:260) 
NO:261) 
NO:262) 
NO:263) 
NO:264) 
NO:265) 
NO:266) 
NO:267) 
NO:268) 
NO:269) 
NO:270) 
NO:27l) 
NO:272) 
NO:273) 
NO:274) 
NO:275) 
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-continued 



10 12 DL3GTCGTTAGTTGG 

11 12 DL3TAGTTGGGAGTT 

12 12 DL3 GGAGTTGATAGTG 

13 12 DL3ATAGTGTGTAGTT 

14 12 DL3GTFTAGTTGACGT 

15 12 DL3TGACGTTGAGGT 

16 12 D L3 CGTTG AGGTTTA 

5 1 3 D L3TATAACATGCCAT 

6 13 D L3.AACATGCC ATGGT 

7 13 DL3CCATGGTATTAT 

8 13 DL3ATTTATGAACTGG 

9 13 D L3 AACTGGTGG A CAT 

10 13 DL3TGGACATCATGTA 



(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 
(SEQ ID. 



NO:233) 5 16 DL3TTGGAAAAAGGT (SEQ ID 

NO:234) 6 16 DL3AAAAGGTTCCTG (SEQ ID 

NO:235) 7 16 DUGGTTCCTGTTTA (SEQ ID 

NO:236) 8 16 DL3CCTGTTTAGTCTC (SEQ ID 

NO:237) 9 16 DL3TTAGTCT L 1 ' 1 1 i ' 1 ' (SEQ ID* 

NO:238) 10 16 DI3CTTTTTCAGAAAT (SEQ ID 

NO:239) 11 16 DL3AGAAATTGAGGTG (SEQ ID 

NO:240) 12 16 DL3AAATTGAGGTGGT (SEQ ID 

NO:241) 13 16 DL3GGTGGTAATCGT (SEQ ED 

NO:242) 14 16 DL3TAATCGTGGGTT (SEQ ID 

NO:243) 15 16 DL3GTGGGTITCGAT (SEQ ID 

NO:244) 16 16 DL3GGTTTCGATTCT (SEQ ID 

NO:245) * * 



NO:276) 
NO:277) 
NO:27S) 
NO:279) 
NO:280) 
NO:28l) 
NO:282) 
NO:283) 
NO:284) 
NO:285) 
NO:286) 
NO;287) 



No probes were present in positions X, Y=0, 12 to X Y=4 
12; X, Y=0, 13 to X, Y-4, 13; X, Y«0, 14 to X Y«4 14- X 
Y-0, 15 to X, Y=4, 15; X, Y=0, 16 to X, Y-4, 16; The length 

01 each of the probes on the chip was variable to minimize 
differences in melting temperature and potential for cross- 
hybndization. Each position in the sequence is represented 
by at least one probe and most positions are represented by 

2 or more probes. As noted above, the amount of overlap 
between the oligonucleotides varies from probe to probe 
FIG. 9 shows the human mitochondrial genome; "0„" is the 
H strand origin of replication, and arrows indicate the cloned 
unshaded sequence. 

DNA was prepared from hair roots of six human donors 
(mtl to mt6) and then amplified by PCR and cloned into 
M13; the resulting clones were sequenced using chain 
terminators to verify that the desired specific sequences were 
present. DNA from the sequenced M13 clones was amplified 
by PCR, transcribed in vitro, and labeled with fluorescein- 
U rp using 13 RNApolymerase. The 1.3 kb RNA transcripts 
were fragmented and hybridized to the chip. The results 
showed that each different individual had DNA that pro- 
duced a unique hybridization fingerprint on the chip and that 
the differences in the observed patterns could be correlated 
with differences in the cloned genomic DNA sequence. The 
results also demonstrated that very long sequences of a 
target nucleic acid can be represented comprehensively as a 
specific set of overlapping oligonucleotides and that arrays 
of such probe sets can be usefully applied to genetic analy- 
sis. J 

The sample nucleic acid was hybridized to the chip in a 
solution composed of 6xSSPE, 0.1% Trilon-X 100 for 60 
minutes at 15° C The chip was then scanned by confocal 
scanning fluorescence microscopy. The individual features 
on the chip were 588x588 microns, but the lower left 5x5 
square features in the array did not contain probes. To 
quantitate the data, pixel counts were measured within each 
synthesis site. Pixels represent 50x50 microns. The fluores- 
cence intensity for each feature was scaled to a mean 
determined from 27 bright features. After scanning, the chip 
was stripped and rehybridized; all six samples were hybrid- 
ized to the same chip. FIG. 10 shows the image observed 
from the mt4 sample on the DNA chip. FIG. 11 shows the 
image observed from the mt5 sample on the DNAchip FIG 
12 shows the predicted difference image between the mt4 
and mt5 samples on the DNA chip based on mismatches 
between the two samples and the reference sequence (sec 
Anderson ct al., 1981, Nature 290: 457-165, incorporated 
herein by reference). FIG. 13 shows the actual difference 
image observed. 

The results show that, in almost all cases, mismatched 
probe/target hybrids resulted in lower fluorescence intensity 
than perfectly matched hybrids. Nonetheless, some probes 
detected mutations (or specific sequences) better than others 



IS . 

and in several cases, the differences were within noise levels 
Improvements can be realized by increasing the amount of 
overlap between probes and hence overall probe density 
and, for duplex DNA targets, using a second set of probes, 
either on the same or a separate chip, corresponding to the 
second strand of the target. FIG. 14, in sheets 1 and 2, shows 
a plot of normalized intensities across rows 10 and 11 of the 
array and a tabulation of the mutations detected. 

FIG. 15 shows the discrimination between wild-type and 
mutant hybrids obtained with this chip. The median of the 
25 six normalized hybridization scores for each probe was 
taken. The graph plots the ratio of the median score to the 
normalized hybridization score versus mean counts. On this 
graph, a ratio of 1.6 and mean counts above 50 yield no false 
positives, and while it is clear that detection of some mutants 
30 can be improved, excellent discrimination is achieved, con- 
sidering the small size of the array. FIG. 16 illustrates how 
the identity of the base mismatch may influence the ability 
to discriminate mutant and wild-type sequences more than 
the position of the mismatch within an oligonucleotide 
35 probe. The mismatch position is expressed as % of probe 
length from the 3'-end. The base change is indicated on the 
graph. These results show that the DNAchip increases the 
capacity of the standard reverse dot blot format by orders of 
magnitude, extending the power of that approach many fold 
and that the methods of the invention are more efficient and 
easier to automate than gel-based methods of nucleic acid 
sequence and mutation analysis. 

These advantages become more apparent as chips with 
more and more probes are employed. To illustrate the 
present invention provides a DNAchip for analyzing human 
45 mitochondrial DNA (mtDNA) that "tiles" through 648 
nucleotides of human H strand mtDNA from positions 
10280 to 356. Hie probes in the array are 15 nucleotides in 
length and each position in the target sequence is repre- 
sented by a set of 4 probes (A, C, G, T substitutions), which 
so differed from one another at position 7 from the 3'-end The 
array consists of 13 blocks of 4x50 probes: each block scans 
through 50 nucleotides of contiguous mtDNA sequence The 
blocks are separated by blank rows. The 4 corner columns 
contain control probes; there are a total of 2600 probes in a 
55 Iff cmx1 - 28 cm square area (feature), and each area is 
256x197 microns. 

Labeled RNA target DNA was prepared by PCR ampli- 
fication of a 1.3 kb region of human mtDNA spannine 
positions 15935 to 667, cloning into M13 (sequence verifi- 
cation was performed), and reamplification of the cloned 
sequences using primers tagged with T3 and T7 RNA 
polymerase promoter sequences and in vitro transcription to 
produce fluorescein-UTP labeled RNA. The RNA was frag- 
mented and hybridized to the oligonucleotide array in a 
solution composed of 6xSSPE, 0.1% Triton X-100 for 60 
65 minutes at 18° C. Unhybridized material was washed away 
with buffer, and the chip was scanned at 25 micron pixel 
resolution. 
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excellent discrimination and some probes did not aD D Tar to nH? , ? ? °<S a «u«d array. Currently available diag- 
hybridize to the target effiS^lK ,o E?T? ; StS for Cancer b ™ a sensitivity of about 

achieved. The target^uence d^ed e pro£ sT " SI'S ^17^ T^' 

six positions: 4 transitions and 2 insertions All 4 transitions fn dZr,;™ ' embodiment provides a method 

were detected, and specific probes couW re^ Uy be ™ he steos of (TJu-^ V* iDV0,VeS 

porated into the array to detect insertions or deletions F?fi Tj? , Ju ( ) mng ? bl0pSy ' which » optionaBy 
19 illustrates the detection ofTtSions in the ,< k C nl sectiooin g <° enrich tumor cells to 

sequence relative to the Zd type P ~ D ,£ ch p 8 15 SS"' Tfi ^V™- ^ ° NA ° r RNAis 
These results illustrate that loLer seo^ences Ln he ? extracted amplified, and analyzed with a DNAchip for 

using the DNA c^ .T^L'TZtZL t ToTT." ,h T^l COt ™ With 

compared to conventional sequencing methods There read Jl< ' f ^ f ™ ° NA cWpS ° f ,he present 

ing length is limited by the relolutionoTgS eWclt o est 20 ^ "12?' 3 ^S"*** ^ by 

Similar results were observed when genomic DNAVmnles '^'u meth ° d t0 pr ™ de an arrav of overlapping 

were prepared from human hair XEJ^tZaS e'xon 6 ^H'^T f ^ * ^ ^ ^ ° f 

signal detection require less than an hour and can be rTaHHv ?• P 8 T ° demonstrale lhe ability to detect 

shortened by appropriate choice of buffer^ TnZrtut * SUbst ! tUt ! on mmat,ons ln the tar get. twelve different single 

probes, and rcagVln princ.^ ££t££%££ 25 ST? TiT ^ ^ *°» m " eM Substi " 

be obtained than by convcntioLl sequenctar w^^^^^^ T each of three positions) were represented on the 

^n^M^ai Jn. let h ITTeS Srom/ S^ - ™*" a .« ba f "» « ^ P-bes' wouW t 
L^raumeni, in which individuals iKSL'SSS ^S^^tS^S^^^ ^ 

Se^ T ° d «' C ^ ° f Pr ° bC addili - a > 

by deletion of the other copy After ^3 L taJSS . r^ '^w'T ^ W3S included for each «»«»*» 
cLmosomal abnormalit esTgin ,0 a P ^ 

the best understood form of cancer CM«rweU ££n^ was represented by every possible over- 
over 50%, perhaps 80%, of all patie^s wnhtlrs have p53 ^1^™' ^ ^ T ° SimpHfy 
mutations. In addition, P 53 mutations have been found fr, a rr, P ? ^P?^ to each va "ed position were 
high proportion of lung, breast, and" he ^^1™ ' « fnl? ^ ^ Ch ' P *" ' he recla "g ular re gions with the 
etal 1990 P/W V 87- 7S^^ 7«o i . 5 followin g slructure: each row of cells represents one 

^^^^^^S^S^A EachS 0 "' Wi * h ,0P r rePreSeDtin S thE ^ ^ 

(l 9 9 2S anDie g oCon 8 ferenc € ),U400m y u ,at"nlS^^^^^^^ ^ t o fT'ST^S be o S ro r Pleme T ary «° ^ ^ 
known. F 6 ° ot lne ,ar 8 e t, with probes complementary to the 

The p53 gene spans 20 kbp in humans and has 11 exon, ™ fh", v a, 1^ °" th ° Md pr ° bcs "'"'plementary to 

10 of which are protein coding (see Tomhal e! T99T wo 7 ° ^ Urge ' °" ^ ^ diS " eBCe betwecn 

Critical Reviews in OncogeJsUl: 2 5 7-282 incomomed h C °'r DS 15 ' Sin8 ' C baSC shift iD ,he Portioning 

herein by reference). The gene prodL a 53 Zdlhon Ik P ' S ', P0SSible ' the series of 10 '^ 

phosphoprotein that regulated DN^Sfcn tSS£° L^r T P ^ f ° Ur ^ im ' nediatel y ""derneath 

acts ,0 halt replication f, the Gl/S bX^K 55 2^ ^ ^ 4 ^ ° f ^ f ° f ' he Same 

doin^E^ -de, ^ f fluoresceins 12-mers 

reproduction of DNA viruses (see Lane 1992 T % P0SSlWe substltutlons ™ the first position of 

15-16, incorporated bcSy Tto^ ^^J? 19 2 were synthesized (see the starred position in the 

tia. interest I the ^"^S^^^JSJ; 60 SpE ^TZ^ 1°^^ DNA 

P 53 mutations. The NCI is currently fundine contracts to ft r^ ,f' 0 25 f Tn f ton X " 100 wcro hybnd.zed to the chip 

characterize the p53 mutation specL Slbv v^, ° ^ temperature for several hours. While target nucleic 

carcinogens. ,n addition, therT aTe research p^jeL Xh SS3"^^ ^'^T^f !!*^'*" 

involve sequencinc o53 from soontanrrmclv ,„v . y g from an argon laser ' and ,he ch P was 

A major resou^in lhe« s.udTes is thfhupe ? ^ 20 au,ofocusin 8 confocal microscope The 

biopsy materia. «J ^ Tp Baffin bToi£. "&> £ " ^JnTet P T" d T PC '° ^ a ° ^ 

projecLs which are aimed at analyzing tnT^l t.onshTn h . 8 ?i By 1 t0 3 h ° UrS ' the si &* 1 

analyzing tne relationship had reached a plateau; to remove the hybridized target and 
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allow hybridization to another target, the chip was stripped For sequencing, the p53 DNA can be cloned from the 
with 60% formamide, 2xSSPE at 17° C for 5 minutes. The sample or directly amplified from genomic DNA by PCR. If 
washing buffer and temperature can vary, but the buffer genomic PCR is used, then the DNA can be diluted prior to 
typically contains 2-to-3xSSPE, 10-to-60% formamide (one amplification so that a single copy of the gene is amplified, 
can use multiple washes, increasing the formamide concen- 5 For diagnostic purposes, the genomic DNA can be isolated 
tration by 10% each wash, and scanning between washes to from a tumor biopsy in which the tumor cells may be the 
determine when the wash is complete), and optionally a majority population. As noted above, the proportion of 
small percentage of Triton X-100, and the temperature is tumor cells in a sample can be enriched by cryostat section- 
typically in the range of 15° to 18° C. ing. DNA can also be isolated and amplified from tumor 

Very distinct patterns were observed after hybridization 10 samples stored in paraffin blocks, 

with targets with 1 base substitutions and visualization with The p53 DNA in the sample can be amplified by PCR 

a confocal microscope and software analysis, as shown in (although other amplification methods can be used) using 

FIG. 22. In general, the probes which form perfect matches 1-4 primer pairs generating amplicons of <3 kbp each, 

with the target retain the highest signal. For example, in the Illustrative primers of the invention for amplifying exon 5 of 
first image in Figure PC, the 12-mer probes that form perfect is the p53 gene are shown below (B is biotin; F is fluorescein) 

matches with the wild-type (WT) target are in the first row 5'-B-CACTTGTGCCCTGACTTTCAAC-3'(SEQ. ID 

(top). The 12-mer probes with single base mismatches are NO: 288) 

located in the second, third, and fourth rows and have much 5 '-F-C ACTTGTG CCCTG ACTTTC AAC-3' 

lower signals. The data is also depicted graphically in FIG. 5'-ATGCAATTAACCCTCACTAAAGGGAGACACTTG- 
23. On each graph, the X ordinate is the position of the probe 20 TGCCCTGACTTTCAAC-3(SEQ. ID NO:289) (has T3 

in its row on the chip, and the Y ordinate is the signal at that promoter) 

probe site after hybridization. 5'-B-GACCCTGGGCAACCAGCCCTGTCGT-3'(SEQ ID 

When a target with a different one base substitution is NO:290) 

hybridized the complementary set of probes has the highest S'-F-GACCCTGGGCAACCAGCCCTGTCGW 

signal (sec pictures 2, 3, and 4 in FIG. 22 and graphs 2, 3, 25 S'-TAATACGACTCACTATAGGGAGGACCCTGGGCA- 

and 4 in FIG. 23). In each case, the probe set with no ACCAGCCCTGTCGT-3'(SEQ. ID NO:291) (has T3 

mismatches with the target has the highest signals. Within a promoter) 

12-mer probe set, the signal was highest at position 6 or 7. After PCR amplification of the target (the amplified target is 
The graphs show that the signal difference between 12-mer called the "amplicon") one strand of the amplicon can then 
probes at the same X ordinate tended to be greatest at 30 be isolated, i.e., using a biotinylated primer that allows 
positions 5 and 8 when the target and the complementary capture of the undesired strand on streptavidin beads, 
probes formed 10 base pairs and 11 base pairs, respectively. Alternatively, asymmetric PCR can be used to generate a 
Because tumors often have both WT and mutant p53 genes, single-stranded target. Another approach involves the gen- 
mixed target populations were also hybridized to the chip, as eration of single stranded RNA form the PCR product by 
shown in FIG. 24, When the hybridization solution consisted 35 incorporating a T7 or other RNA polymerase promoter in 
of a 1:1 mixture of WT 12-mer and a 12-mer with a one of the primers. The single-stranded material can option- 
substitution in position 7 of the target, the sets of probes that ally be fragmented to generate smaller nucleic acids with 
were perfectly matched to both targets showed higher sig- less significant secondary structure than longer nucleic 
nals than the other probe sets. acids. 

The hybridization efficiency of a 10-mer probe array as 40 In one such method, fragmentation is combined with 
compared to a 12-mer probe array was also compared. The labeling. To illustrate, degenerate 8-mers or other degenerate 
10-mer and 12-mer probe arrays gave comparable signals short oligonucleotides are hybridized to the single -stranded 
(see graphs 1^ in FIG. 23 and graphs 1-4 in FIG. 25). target material. In the next step, a DNA polymerase is added 
However, the 10-mer probe sets, which are in rows 5-8 (see with the four different dideoxynucleotides, each labeled with 
images in FIG. 22), seemed to be better in this model system 45 a different fluorophore. Fluorophore-labeled dideoxynucle- 
than the 12-mer probe sets at resolving one target from otide are available from a variety of commercial suppliers 
another, consistent with the expectation that one base mis- such as ABI. Hybridized 8-mers are extended by a labeled 
matches are more destabilizing for 10-mers than 12-mers. dideoxynucleotide. After an optional purification step, i.e., 
Hybridization results within probe sets perfectly matched to with a size exclusion column, the labeled 9-mers are hybrid- 
target also followed the expectation that, the more matches 50 ized to the chip. Other methods of target fragmentation can 
the individual probe formed with the target, the higher the be employed. The single-stranded DNA can be fragmented 
signal. However, duplexes with two 3' dangles (see FIG. 23, by partial degradation with a DNAse or partial depurination 
position 6 m graphs 1-4) have about as much signal as the with acid. Labeling can be accomplished in a separate step 
probes which are matched along their entire length (see FIG. i.e., fluorophore-labeled nucleotides are incorporated before 
23, position 7, in graphs 1^). 55 the fragmentation step or a DNA binding fluorophore, such 

This illustrative model system shows that 12-mer targets as ethidium homodimer, is attached to the target after 

that differ by one base substitutions can be readily distin- fragmentation. 

guished from one another by the novel probe array provided In one embodiment, the DNA chip has an array of 10 4 to 

by the invention and that resolution of the different 12-mer 10 5 probes tiling across the protein coding regions of p53, 

targets was somewhat better with the 10-mer probe sets than 60 which comprise about 1200 bp; smaller arrays specific for 

with the 12-mer probe sets. The value of having several the 600 bp mutational hot spot region are also useful. The 

overlapping probes hybridizing to a target demonstrates the probes overlap for N-2 to N-4 bases, where N is the length 

value of the multiple hybridization events that take place on of the probe in bases. N is typically 10 to 14 bases long, but 

a DNA chip of the invention. The results also demonstrate as will be seen below, probes 15 to 19 bases and longer are 

the feasibility of constructing a probe set to sequence the 65 also useful. Every possible single base substitution occur- 

entire 1.4 kbp protein coding region of p53 or alternatively ring one at a time is represented in the array. The number of 

the 0.6 kbp of exons 5-9 containing mutation hot spots. unique 10-mer probes with 7 base overlaps would be about 
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(1200/3)x4xl0 or about 1.6xl0 4 . To allow 3 replicates of 
each probe, one might have a total array size on the order of 
4.8xl0 4 probes. Of course, arrays of probes within the 
ranges of 10 2 to 10 tf probes are also useful for applications; 
for example, very large arrays of 10 6 or more probes are 5 
useful for sequencing or sequence checking large genomic 
DNA fragments. Optionally fragmented and labeled target 
nucleic acid hybridized to the chip is detected by a confocal 
microscope or other imaging device. The pattern of sites 
"lighting up" with target is preferably analyzed with com- 10 
puter assistance to provide the sequence of the target from 
the pattern of sites producing signals. 

The invention is illustrated below with examples of DNA 
chips comprising very large arrays of DNA probes to "rese- 
quence" p53 target nucleic acid in a sample. To analyze 15 
DNA from exon 5 of the p53 tumor suppressor gene, a set 
of overlapping 17-mer probes was synthesized on a chip. 
The probes for the WT allele were synthesized so as to tile 
across the entire exon with single base overlaps between 
probes. For each WT probe, a sets of 4 additional probes, 20 
one for each possible base substitution at position 7, were 
synthesized and placed in a column relative to the WT probe. 
Exon 5 DNA was amplified by PCR with primers flanking 
the exon. One of the primers was labeled with fluorescein; 
the other primer was labeled with biotin. After amplification, 25 
the biotinylatcd strand was removed by binding to strcpta- 
vidin beads. The fluoresceinated strand was used in hybrid- 
ization. 

About Vi of the amplified, single-stranded nucleic acid 
was hybridized overnight in SxSSPE at 60° C. to the probe 30 
chip (under a cover slip). After washing with 6xSSPE, the 
chip was scanned using confocal microscopy. FIG. 26 shows 
an image of the p53 chip hybridized to the target DNA. 
Analysis of the intensity data showed that 93.5% of the 184 
bases of exon 5 were called in agreement with the WT 35 
sequence (see Buchman et al., 1988, Gene 70: 245-252, 
incorporated herein by reference). The miscalled bases were 
from positions where probe signal intensities were tied 
(1.6%) and where non-WT probes had the highest signal 
intensity (4.9%). FIG. 27 illustrates how the actual sequence 40 
was read. Gaps in the sequence of letters in the WT rows 
correspond to control probes or sites. Positions at which 
bases are miscalled are represented by letters in italic type in 
cells corresponding to probes in which the WT bases have 
been substituted by other bases. 45 

As the diagram indicates, the miscalled bases are from the 
low intensity areas of the image, which may be due to 
secondary structure in the target or probes preventing inter- 
molecular hybridization. To diminish the effects due to 
secondary structure, one can employ shorter targets (i.e., by 50 
target fragmentation) or use more stringent hybridization 
conditions. In addition, the use of a set of probes synthesized 
by tiling across the other strand of a duplex target can also 
provide sequence information buried in secondary structure 
in the other strand. It should be appreciated, however, that 55 
the pattern of low intensity areas that forms as a result of 
secondary structure in the target itself provides a means to 
identify that a specific target sequence is present in a sample. 
Other factors that may contribute to lower signal intensities 
include differences in probe densities and hybridization 60 
stabilities. 

These results demonstrate the advantages provided by the 
DNA chips of the invention to genetic analysis. As another 
example, heterozygous mutations are currently sequenced 
by an arduous process involving cloning and repurification 65 
of DNA. The cloning step is required, because the gel 
sequencing systems are poor at resolving even a 1:1 mixture 
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of DNA. First, the target DNA is amplified by PCR with 
primers allowing easy ligation into a vector, which is taken 
up by transformation of E. coli which in turn must be 
cultured, typically on plates overnight. After growth of the 
bacteria, DNA is purified in a procedure that typically takes 
about 2 hours; then, the sequencing reactions are performed, 
which takes at least another hour, and the samples are run on 
the gel for several hours, the duration depending on the 
length of the fragment to be sequenced. By contrast, the 
present invention provides direct analysis of the PCR ampli- 
fied material after brief transcription and fragmentation 
steps, saving days of time and labor. 

An interesting clinical application for the characterization 
of heterozygous mutations with DNA chips is as follows. 
Individuals with germline cancer mutations have a very high 
risk for secondary tumors after treatment by irradiation. 
About 10% of all cancer patients may have germline muta- 
tions for p53 or other tumor suppressor genes. Thus, before 
deciding on a treatment modality, a physician could use the 
method and DNA chips of the invention to test for a 
germline suppressor gene mutation. 
DNA Chips for Rational Therapeutic Management 

The present invention also provides DNA chips that can 
be used by physicians to determine optimum therapeutic 
protocols by early, rapid detection of biologically mediated 
resistance to a therapeutic agent in a variety of disease states. 
The benefits of such DNA chips are many, as the chips will 
help physicians recognize health care cost savings, achieve 
rapid therapeutic benefits, limit administration of ineffective 
(due to the resistance) yet toxic drugs, monitor changes in 
pathogen resistance, and decrease pathogen acquisition of 
resistance. Important applications include the treatment of 
HIV, other infectious diseases, and cancer. 

HIV has infected a large and expanding number of people, 
resulting in massive health care expenditures. HIV can 
rapidly become resistant to drugs used to treat the infection, 
primarily due to the action of the heterodimeric protein (51 
kD and 66 kD) HIV reverse transcriptase (RT) encoded by 
the 1.7 kb pol gene. The high error rate (5-10 per round) of 
the RT protein is believed to account for the hypermu lability 
of HIV. The nucleoside analogues, i.e., AZT, ddl, ddC, and 
d4T, commonly used to treat HIV infection are converted to 
nucleotide analogues by sequential phosphorylation in the 
cytoplasm of infected cells, where incorporation of the 
analogue into the viral DNA results in termination of viral 
replication, because the 5'-*3' phosphodiester linkage can- 
not be completed. However, within after 6 months to 1 year 
of treatment, HIV typically mutates the RT gene so as to 
become incapable of incorporating the analogue and so 
resistant to treatment. Several known mutations are shown 
in tabular form below. 



RT MUTATIONS ASSOCIATED WITH DRUG RESISTANCE 



ANTI- 
VIRAL CODON aa CHANGE nt CHANGE 



AZT 


67 


Asp 


-> Asn 


GAC 


-> AAC 


AZT 


70 


Lys 


-> Arg 


AAA 


-> AG A 


AZT 


215 


Thr 


-> Phe or Tyr 


ACC 


-> TTC or TAC 


AZT 


219 


Lys 


-> Gin or Glu 


AAA 


-> CAAorGAA 


AZT 


41 


Met 


-> Leu 


ATG 


-> TTG or CTG 


ddl and ddC 


184 


Met 


-> Val 


ATG 


-> GTG 


ddl and ddC 


74 


Leu 


-> Val 






TIBO 82150 


100 


Leu 


-> De 







N,B. other mutations confer resistance to other drugs in vitro 



The present invention provides DNA chips for detecting 
the multiple mutations in the HIV RT gene associated with 
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resistance to different therapeutics. These DNA chips will to gain primary structure information of the DNA target, 

enable physicians to monitor mutations over time and to This format has important applications in sequencing by 

change therapeutics if resistance develops. The DNA chip hybridization, DNA diagnostics and in elucidating the ther- 

will provide redundant confirmation of conserved HIV RT modynamic parameters affecting nucleic acid recognition, 

and other gene sequences, and the probes on the chip will tile 5 Conventional DNA sequencing technology is a laborious 

through, with overlap, in important mutational hot spot procedure requiring electrophoretic size separation of 

regions. The chip will optionally have probes that span the labeled DNA fragments. An alternative approach, termed 

entire coding region of the RT and optionally the genes for Sequencing By Hybridization (SBH), has been proposed 

other HIV proteins, such as coat proteins. HIV target nucleic (Lysov et ah, 1988, Dokl. Akad. NaukSSSR 303: 1508-1511; 

acid can be isolated from blood samples (peripheral blood 10 Bains et al., 1988, /. Theor. Biol 135: 303-307; and 

lymphocytes or PBMC) and amplified by PCR, primers for Drmanac et al., 1989, Genomics 4: 114-128, incorporated 

which are shown in the table below. herein by reference). This method uses a set of short 



AMPLIFICATION OF TARGET 

TARGET 

SIZE PRIMER 1 PRIMER 2 



1, 742bp GTAGAATTCTGTTGACTCAGATTGG GATAAGCTTGGGCCTTATCTATTCCAT 

(SEQ ID. NO:292) (SEQ ID. NO:294) 

535bp AAATCCATACAATACTCCAGTATTTGC ACCCATCCAAAGGAATGGAGGTTCTTTC 

(SEQ ID. NO:293) (SEQ ID. NO:295) 

323bp GcDbank#K02013 1889-1908 bases 2211-2192 



The HIV RT gene chips of the invention, as well as the CF, 
mtDNA, and p53 DNA chips of the invention, illustrate the 
diverse application of the methods and probe arrays of the 
invention. The examples that follow describe methods for 
preparing nucleic acid targets from samples for application 3Q 
to the DNA chips of the invention and provide additional 
details of the methods of the invention. 

EXAMPLES 
I. VLSIPS™ Technology 

As noted above, the VLSIPS™ technology is described in 35 
a number of patent publications and is preferred for making 
the oligonucleotide arrays of the invention. For 
completeness, a brief description of how this technology can 
be used to make and screen DNA chips is provided in this 
Example and the accompanying Figures. In the VLSIPS 40 
method, light is shone through a mask to activate functional 
(for oligonucleotides, typically an — OH) groups protected 
with a photo re movable protecting group on a surface of a 
solid support. After light activation, a nucleoside building 
block, itself protected with a photo re movable protecting 45 
group (at the 5' — OH), is coupled to the activated areas of 
the support. The process can be repeated, using different 
masks or mask orientations and building blocks, to prepare 
very dense arrays of many different oligonucleotide probes. 
The process is illustrated in FIG. 28; FIG. 29 illustrates how 50 
the process can be used to prepare "nucleoside combinato- 
rials" or oligonucleotides synthesized by coupling all four 
nucleosides to form dimers, trimers, etc. 

New methods for the combinatorial chemical synthesis of 
peptide, polycarbamate, and oligonucleotide arrays have 55 
recently been reported (see Fodor et al., 1991, Science 251: 
767-773; Cho et al., 1993, Science 261: 1303-1305; and 
Southern et al., 1992, Genomics 13: 1008-10017, each of 
which is incorporated herein by reference). These arrays, or 
biological chips (see Fodor et al., 1993, Nature 364: 60 
555-556, incorporated herein by reference), harbor specific 
chemical compounds at precise locations in a high -density, 
information rich format, and are a powerful tool for the 
study of biological recognition processes. A particularly 
exciting application of the array technology is in the field of 65 
DNA sequence analysis. The hybridization pattern of a DNA 
target to an array of shorter oligonucleotide probes is used 



oligonucleotide probes of defined sequence to search for 
complementary sequences on a longer target strand of DNA. 
The hybridization pattern is used to reconstruct the target 
DNA sequence. It is envisioned that hybridization analysis 
of large numbers of probes can be used to sequence long 
stretches of DNA. In immediate applications of this hybrid- 
ization methodology, a small number of probes can be used 
to interrogate local DNA sequence. 

The strategy of SBH can be illustrated by the following 
example. A 12-mcr target DNA sequence, 
AGCCTAGCTGAA, (SEQ. ID NO:296) is mixed with a 
complete set of octanucleotide probes. If only perfect 
complementarity is considered, five of the 65,536 octamer 
probes -TCGGATCG, CGGATCGA, GGATCGAC, 
GATCGACT, and ATCGACTT will hybridize to the target. 
Alignment of the overlapping sequences from the hybridiz- 
ing probes reconstructs the complement of the original 
12-mer target: 



TCGGATCG 
CGGATCGA 
GGATCGAC 
GATCGACT 
ATCGACTT 
TCGGATCGAC1T (SEQ. ID NO:297) 



Hybridization methodology can be carried out by attaching 
target DNA to a surface. The target is interrogated with a set 
of oligonucleotide probes, one at a time (see Strezoska et al., 
1991, Proc. Natl Acad. Sci. USA 88: 10089-10093, and 
Drmanac et al., 1993, Science 260: 1649-1652, each of 
which is incorporated herein by reference). This approach 
can be implemented with well established methods of immo- 
bilization and hybridization detection, but involves a large 
number of manipulations. For example, to probe a sequence 
utilizing a full set of octanuclcotides, tens of thousands, of 
hybridization reactions must be performed. Alternatively, 
SDH can be carried out by attaching probes to a surface in 
an array format where the identity of the probes at each site 
is known. The target DNA is then added to the array of 
probes. The hybridization pattern determined in a single 
experiment directly reveals the identity of all complemen- 
tary probes. 
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As noted above, a preferred method of oligonucleotide of the probes will generate detectable signals. Modifying the 

probe array synthesis involves the use of light to direct the above expression for N, one arrives at a relationship esti- 

synthesis of oligonucleotide probes in high-density, minia- mating the number of detectable hybridizations (Nd) for a 

turized arrays. Photolabile 5'-protected N-acyl- DNA target of length Lt and an array of complexity C. 

deoxynucleoside phosphoramidites, surface linker 5 Assuming an average of 5 positions giving signals above 

chemistry, and versatile combinatorial synthesis strategies background: Nd=(l+5(C-l))[Lt-(Lp-l)]. 

have been developed for this technology. Matrices of Arrays of oligonucleotides can be efficiently generated by 

spatially-defined oligonucleotide probes have been light-directed synthesis and can be used to determine the 

generated, and the ability to use these arrays to identify identity of DNA target sequences. Because combinatorial 

complementary sequences has been demonstrated by ao strategies are used, the number of compounds increases 

hybridizing fluorescent labeled oligonucleotides to the DNA exponentially while the number of chemical coupling cycles 

chips produced by the methods. The hybridization pattern increases only linearly. For example, expanding the synthe- 

demonstrates a high degree of base specificity and reveals sis to the complete set of 4 8 (65,536) octanucleotides will 

the sequence of oligonucleotide targets. add only four hours to the synthesis for the 16 additional 

The basic strategy for light-directed oligonucleotide syn- 15 cycles. Furthermore, combinatorial synthesis strategies can 

thesis (1) is outlined in FIG. 28. The surface of a solid be implemented to generate arrays of any desired composi- 

support modified with photolabile protecting groups (X) is tion. For example, because the entire set of dodecamers (4 12 ) 

illuminated through a photolithographic mask, yielding can be produced in 48 photolysis and coupling cycles (b w 

reactive hydroxyl groups in the illuminated regions. A compounds requires bxn cycles), any subset of the dodecam- 

3'-0-phosphoramidite activated deoxynucleoside (protected 20 ers (including any subset of shorter oligonucleotides) can be 

at the 5 '-hydroxyl with a photolabile group) is then presented constructed with the correct lithographic mask design in 48 

to the surface and coupling occurs at sites that were exposed or fewer chemical coupling steps. In addition, the number of 

to light. Following capping, and oxidation, the substrate is compounds in an array is limited only by the density of 

rinsed and the surface illuminated through a second mask, to synthesis sites and the overall array size. Recent experi- 

exposc additional hydroxyl groups for coupling. A second 25 ments have demonstrated hybridization to probes synthe- 

5 f -protectcd, 3'-0-phosphoramiditc activated dcoxynuclco- sized in 25 fim sites. At this resolution, the entire set of 

side is presented to the surface. The selective photodepro- 65,536 octanucleotides can be placed in an array measuring 

tection and coupling cycles are repeated until the desired set 0.64 cm square, and the set of 1,048,576 dodecanucleotides 

of products is obtained. requires only a 2.56 cm array. 

Light directed chemical synthesis lends itself to highly 30 Genome sequencing projects will ultimately be limited by 
efficient synthesis strategies which will generate a maximum DNA sequencing technologies. Current sequencing method- 
number of compounds in a minimum number of chemical ologies are highly reliant on complex procedures and require 
steps. For example, the complete set of 4n polynucleotides substantial manual eflbrt. Sequencing by hybridization has 
(length n), or any subset of this set can be produced in only the potential for transforming many of the manual efforts 
4xn chemical steps. See FIG. 29. The patterns of illumina- 35 into more efficient and automated formats. Light-directed 
tion and the order of chemical reactants ultimately define the synthesis is an efficient means for large scale production of 
products and their locations. Because photolithography is miniaturized arrays for SB H. The oligonucleotide arrays are 
used, the process can be miniaturized to generate high- not limited to primary sequencing applications. Because 
density arrays of oligonucleotide probes. For an example of single base changes cause multiple changes in the hybrid- 
the nomenclature useful for describing such arrays, an array 40 ization pattern, the oligonucleotide arrays provide a power- 
containing all possible octanucleotides of dA and dT is fill means to check the accuracy of previously elucidated 
written as (A+T^ 8 . Expansion of this polynomial reveals the DNA sequence, or to scan for changes within a sequence. In 
identity of all 256 octanucleotide probes from AAAAAAAA the case of octanucleotides, a single base change in the target 
to TTTTTTTT. A DNA array composed of complete sets of DNA results in the loss of eight complements, and generates 
dinucleotides is referred to as having a complexity of 2. The 45 eight new complements. Matching of hybridization patterns 
array given by (A+T+C+G)8 is the full 65,536 octanucle- may be useful in resolving sequencing ambiguities from 
otide array of complexity four. standard gel techniques, or for rapidly detecting DNA muta- 

To carry out hybridization of DNA targets to the probe tional events. The potentially very high information content 

arrays, the arrays arc mounted in a thermostatically con- of light-directed oligonucleotide arrays will change genetic 

trolled hybridization chamber. Fluorescein labeled DNA 50 diagnostic testing. Sequence comparisons of hundreds to 

targets are injected into the chamber and hybridization is thousands of different genes will be assayed simultaneously 

allowed to proceed for Vi to 2 hours. The surface of the instead of the current one, or few at a time format. Custom 

matrix is scanned in an epi fluoresce nee microscope (Zeiss arrays can also be constructed to contain genetic markers for 

Axioscop 20) equipped with photon counting electronics the rapid identification of a wide variety of pathogenic 

using 50-100 /<W of 488 nm excitation from an Argon ion 55 organisms. 

laser (Spectra Physics model 2020). All measurements are Oligonucleotide arrays can also be applied to study the 

acquired with the target solution in contact with the probe sequence specificity of RNA or protein-DNA interactions, 

matrix. Photon counts are stored and image files are pre- Experiments can be designed to elucidate specificity rules of 

sented after conversion to an eight bit image format. See non Watson-Crick oligonucleotide structures or to investi- 

FIG. 33. 60 gate the use of novel synthetic nucleoside analogs for 

When hybridizing a DNA target to an oligonucleotide antiscose or triple helix applications. Suitably protected 

array, N»Lt-(Lp-l) complementary hybrids are expected, RNA monomers may be employed for RNA synthesis. The 

where N is the number of hybrids, Lt is the length of the oligonucleotide arrays should find broad application deduc- 

DNA target, and Lp is the length of the oligonucleotide ing the thermodynamic and kinetic rules governing forma- 

probes on the array. For example, for an 11-mer hybridized 65 tion and stability of oligonucleotide complexes, 

to an octanucleotide array, N=4. Hybridizations with mis- Other than the use of photoremovable protecting groups, 

matches at positions that are 2 to 3 residues from either end the nucleoside coupling chemistry is very similar to that 
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used routinely today for oligonucleotide synthesis. FIG. 30 
shows the deprotection, coupling, and oxidation steps of a 
solid phase DNA synthesis method. FIG. 31 shows an 
illustrative synthesis route for the nucleoside building blocks 
used in the method. FIG. 32 shows a preferred photoremov- 
able protecting group, MeNPOC, and how to prepare the 
group in active form. The procedures described below show 
how to prepare these reagents. The nucleoside building 
blocks are 5'-MeNPOC-THYMIDINE-3'-OCEP; 
5'-MeNPOC-N 4 -t-BUTYL PHENOXYACETYL- 
DEOXYCYnDINE-3-OCEP; 5'-MeNPOC-N 4 -t-BUTYL 
PHENOXYACETYL-DEOXYGUANOSINE-3'-OCEP; 
and 5*-MeNPOC-N 4 -t-BUTYL PHENOXYACETYL- 
DEOXYADENOSINE-3'-OCEP. 

A. Preparation of 4, 5-methylenedioxy-2-nitroacetophenone 
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minimum volume of CH 2 C1 2 or THF(~175 ml) and then 
precipitating it by slowly adding hexane (1000 ml) while 
stirring (yield 51 g; 80% overall). It can also be recrystal- 
lized (eg., toluene-hexane), but this reduces the yield. 
C. Preparation of l-(4,5- methyIenedioxy-2-nitrophenyl) 
ethyl chloroformate (MeNPOC-Cl) 



no 2 



V.0 



coa 2 v 

Toluene/THF*' 
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A solution of 50 g (0.305 mole) 3,4- 
methylenedioxyacetophenone (Aldrich) in 200 mL glacial 
acetic acid was added dropwise over 30 minutes to 700 mL 
of cold (2-4° C.) 70% HN0 3 with stirring (NOTE: the 
reaction will overheat without external cooling from an ice 
bath, which can be dangerous and lead to side products). At 
temperatures below 0° C, however, the reaction can be 
sluggish. A temperature of 3°-5° C. seems to be optimal). 
The mixture was left stirring for another 60 minutes at 3°-5° 
C, and then allowed to approach ambient temperature. 
Analysis by TLC (25% EtOAc in hexane) indicated com- 
plete conversion of the starting material within 1-2 hr. When 
the reaction was complete, the mixture was poured into -3 
liters of crushed ice, and the resulting yellow solid was 
filtered off, washed with water and then suction -dried. Yield 
-53 g (84%), used without further purification, 
B. Preparation of l-(4,5-Methylenedioxy-2-nitrophenyl) 
ethanol 
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Phosgene (500 mLof 20% w/v in toluene from Fluka: 965 
mmole; 4 eq.) was added slowly to a cold, stirring solution 
of 50 g (237 mmole; 1 eq.) of l-(4,5-methylenedioxy-2- 
nitrophenyl)ethanol in 400 mL dry THF. The solution was 
stined overnight at ambient temperature at which point TLC 
30 (20% Et 2 0/hexane) indicated >95% conversion. The mix- 
ture was evaporated (an oil-less pump with downstream 
aqueous NaOH trap is recommended to remove the excess 
phosgene) to afford a viscous brown oil. Purification was 
effected by flash chromatography on a short (9x13 cm) 
35 column of silica gel eluted with 20% Et 2 0/hexane. Typically 
55 g (85%) of the solid yellow McNPOC-Cl is obtained by 
this procedure. The crude material has also been rccrystal- 
lized in 2-3 crops from 1:1 ether/hexane. On this scale, -100 
ml is used for the first crop, with a few percent THF added 
40 to aid dissolution, and then cooling overnight at -20° C. (this 
procedure has not been optimized). The product should be 
stored dessicated at -20° C. 

1). Synthesis of 5'-MeNPOC-2'-DEOXYNUCLEOSIDE-3'- 
(N,N-DIlSOPROPYL 2- CYAN O ETHYL PHOSPHORA- 
45 MIDITES 

(1) 5'-MeNPOC-Nucleosides 



V- o 



HO 
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Sodium borohydride (10 g; 0.27 mol) was added slowly 
to a cold, stirring suspension of 53 g (0.25 mol) of 4,5- 
methylenedioxy-2-nitroacetophenone in 400 mL methanol. 
The temperature was kept below 10° C. by slow addition of 55 
the NaBH 4 and external cooling with an ice bath. Stirring 
was continued at ambient temperature for another two hours, 
at which time TLC (CHjClJ indicated complete conversion 
of the ketone. The mixture was poured into one liter of 
ice-water and the resulting suspension was neutralized with 60 
ammonium chloride and then extracted three times with 400 
mL CH 2 C1 2 or EtOAc (the product can be collected by 
filtration and washed at this point, but it is somewhat soluble 
in water and this results in a yield of only -60%). The 
combined organic extracts were washed with brine, then 65 
dried with MgS0 4 and evaporated. The crude product was 
purified from the main byproduct by dissolving it in a 




Base MenpocCl ^ 
Pyridine ^ 



McnpocO' 




Base 



Base»TH YMIDINE (T); N-4-ISOBUTYRYL 
2'-DEOXYCYTIDINE (ibu-dC); N-2-PHENOXYACETYL 
2'DEOXYGUANOSINE (PAC-dG); and N-6- 
PHENOXYACETYL 2'DEOXYADENOSINE (PAC-dA) 

All four of the 5'-MeNPOC nucleosides were prepared 
from the base-protected 2'-deoxynucIeosides by the follow- 
ing procedure. The protected 2'-deoxynucleoside (90 
mmole) was dried by co-evaporating twice with 250 mL 
anhydrous pyridine. The nucleoside was then dissolved in 
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300 mL anhydrous pyridine (or 1:1 pyridine/DMF, for the 
dG^ 0 nucleoside) under argon and cooled to -2° C. in an 
ice bath. A solution of 24.6 g (90 mmole) MeNPOC-Cl in 
100 mL dry THP was then added with stirring over 30 
minutes. The ice bath was removed, and the solution allowed 
to stir overnight at room temperature (TLC: 5-10% MeOH 
in CH 2 C1 2 ; two diastereomers). After evaporating the sol- 
vents under vacuum, the crude material was taken up in 250 
mL ethyl acetate and extracted with saturated aqueous 
NaHC0 3 and brine. The organic phase was then dried over 
Na 2 S0 4 , filtered and evaporated to obtain a yellow foam. 
The crude products were finally purified by flash chroma- 
tography (9x30 cm silica gel column eluted with a stepped 
gradient of 2%-6% MeOH in CH^IJ. Yields of the puri- 
fied diastereomeric mixtures are in the range of 65-75%. 

(2) 5'-MeNPOC-2'-DEOXYNUCLEOSIDE-3'-(N,N- 
DIISOPROPYL 2-CYANOETHYL 
PHOSPHORAMIDITES) 



10 



McnpocO" 




Base 



"Am id king reagent" 
DIEA/DCM 



MenpocO' 




20 



Base 25 



OCH2CH2CN 30 



The four dcoxynuclcosidcs were phosphitylatcd using 35 
either 2-cyanoethyl-N,N-diisopropyl 
cblorophosphoramidite, or 2-cyanoethyl-N,N,N\N f - 
tetraisop ropy Iphosphorodia mid ite. The following is a typi- 
cal procedure. Add 16.6 g (17.4 ml; 55 mmole) of 
2-cyanoethyl-N,N,N , ,N'-tetraisopropylphosphorodiamidite 40 
to a solution of 50 mmole S'-MeNPOC-nucleoside and 4.3 
g (25 mmole) diisopropylammonium tetrazolide in 250 mL 
dry CH 2 C1 2 under argon at ambient temperature. Continue 
stirring for 4-16 hours (reaction monitored by TLC: 
45:45:10 hexane/CH 2 Cl 2 /Et 3 N). Wash the organic phase 45 
with saturated aqueous NaHC0 3 and brine, then dry over 
Na 2 S0 4 , and evaporate to dryness. Purify the crude amiditc 
by flash chromatography (9x25 cm silica gel column eluted 
with hexane/CH 2 Cl/TEA -45:45: 10 for A, C, T; or 0:90:10 
for G). The yield of purified amidite is about 90%. 
II. PREPARATION OF LABELED DNA/ 
HYBRIDIZATION TO ARRAY 

1) PCR 

PCR amplification reactions are typically conducted in a 
mixture composed of per reaction: 1 fi\ genomic DNA; 10 /d 
each primer (10 pmol^l stocks); 10/il lOxPCR buffer (100 
mM Tris.Cl pH8.5, 500 mM KC1, 15 mM MgCl 2 ); 10//1 2 
mM dNTPs (made from 100 mM dNTP stocks); 2.5 U Taq 
polymerase (Perkin Elmer AmpliTaq™, 5 U//d); and H 2 0 to 
100 fA. The cycling conditions are usually 40 cycles (94° C. 
45 sec, 55° C. 30 sec, 72° C. 60 sec) but may need to be 
varied considerably from sample type to sample type. These 
conditions are for 0.2 mL thin wall tubes in a Perkin Elmer 
9600 thermocycler. See Perkin Elmer 1992/93 catalogue for 
9600 cycle time information. Target, primer length and 
sequence composition, among other factors, may also affect 
parameters. 



50 



55 



60 
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For products in the 200 to 1000 bp size range, check 2 /d 
of the reaction on a 1.5% 0.5xTBE agarose gel using an 
appropriate size standard (phiX174 cut with Haelll is 
convenient). The PCR reaction should yield several pico- 
moles of product. It is helpful to include a negative control 
(i.e., 1 jUl TE instead of genomic DNA) to check for possible 
contamination. To avoid contamination, keep PCR products 
from previous experiments away from later reactions, using 
filter tips as appropriate. Using a set of working solutions 
and storing master solutions separately is helpful, so long as 
one does not contaminate the master stock solutions. 

For simple amplifications of short fragments from 
genomic DNA it is, in general, unnecessary to optimize 
Mg 2+ concentrations. A good procedure is the following: 
make a master mix minus enzyme; dispense the genomic 
DNA samples to individual tubes or reaction wells; add 
enzyme to the master mix; and mix and dispense the master 
solution to each well, using a new filter tip each time. 

2) PURIFICATION 

Removal of unincorporated nucleotides and primers from 
PCR samples can be accomplished using the Promega 
Magic PCR Preps DNA purification kit. One can purify the 
whole sample, following the instructions supplied with the 
kit (proceed from section IIIB, 'Sample preparation for 
direct purification from PCR reactions*). After elution of the 
PCR product in 50 ft\ of TE or H 2 0, one centrifuges the 
eluate for 20 sec at 12,000 rpm in a microfuge and carefully 
transfers 45 fi\ to a new microfuge tube, avoiding any visible 
pellet. Resin is sometimes carried over during the elution 
step. This transfer prevents accidental contamination of the 
linear amplification reaction with * Magic PCR' resin. Other 
methods, e.g. size exclusion chromatography, may also be 
used. 

3) LINEAR AMPLIFICATION 

In a 0.2 mL thin-wall PCR tube mix: 4 /A purified PCR 
product; 2 /d primer (10 pmol//d); 4/d lOxPCR buffer; 4 /*1 
dNTPs (2 mM dA, dC, dG, 0.1 mM dT)i 4/d 0.1 mM dUTP; 
1 (A 1 mM fluorescein dUTP (Amersham RPN 2121); 1 U 
Taq polymerase (Perkin Elmer, 5 U//d); and add H 2 0 to 40 
/*L Conduct 40 cycles (92° C. 30 sec, 55° C. 30 sec, 72° C. 
90 sec) of PCR. These conditions have been used to amplify 
a 300 nucleotide mitochondrial DNA fragment but are 
generally applicable. Even in the absence of a visible 
product band on an agarose gel, there should still be enough 
product to give an easily detectable hybridization signal. If 
one is not treating the DNA with uracil DNA glycosylase 
(sec Section 4), dUTP can be omitted from the reaction. 

4) FRAGMENTATION 

Purify the linear amplification product using the Promega 
Magic PCR Preps DNA purification kit, as per Section 2 
above. In a 0.2 mL thin-wall PCR tube mix: 40 /<! purified 
labeled DNA; 4 /d lOxPCR buffer; and 0.5 y\ uracil DNA 
glycosylase (BRL lU//d). Incubate the mixture 15 min at 
37° C, then 10 min at 97° C; store at -20° C. until ready 
to use. 

5) HYBRIDIZATION SCANNING & STRIPPING 

A blank scan of the slide in hybridization buffer only is 
helpful to check that the slide is ready for use. The buffer is 
removed from the flow cell and replaced with 1 mL of 
(fragmented) DNA in hybridization buffer and mixed well. 
The scan is performed in the presence of the labeled target. 
FIG. 33 illustrates an illustrative detection system for scan- 
ning a DNA chip. A series of scans at 30 min intervals using 
a hybridization temperature of 25° C. yields a very clear 
signal, usually in at least 30 min to two hours, but it may be 
desirable to hybridize longer, i.e., overnight. Using a laser 
power of 50 /iW and 50 /*ra pixels, one should obtain 
maximum counts in the range of hundreds to low thousands/ 
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pixel for a new slide. When finished, the slide can be 
stripped using 50% formamide. rinsing well in deionized 
H 2 0, blowing dry, and storing at room temperature. 
III. PREPARATION OF LABELED RNA 
/HYBRIDIZATION TO ARRAY 5 
1) TAGGED PRIMERS 

The primers used to amplify the target nucleic acid should 
have promoter sequences if one desires to produce RNA 
from the amplified nucleic acid. Suitable promoter 
sequences are shown below and include: io 

(1) the T3 promoter sequence: 

5'-CGGAATTAACCCTCACTAAAGG (SEQ. ID NO:298) 
5'-AATTAACCCTCACTAAAGGGAG; (SEQ. ID NO:299) 

(2) the T7 promoter sequence: 

5' TAATACGACTCACTATAGGGAG; (SEQ. ID NO:300) 15 
and (3) the SP6 promoter sequence: 
5' ATTTAGGTGACACTATAGAA. (SEQ. ID NO:301) 
The desired promoter sequence is added to the 5' end of the 
PCR primer. It is convenient to add a different promoter to 
each primer of a PCR primer pair so that either strand may 20 
be transcribed from a single PCR product. 

Synthesize PCR primers so as to leave the DMT group on. 
DMT-on purification is unnecessary for PCR but appears to 
be important for transcription. Add 25 /d 0.5M NaOH to 
collection vial prior to collection of oligonucleotide to keep 25 
the DMT group on. Deprotcct using standard chemistry — 
55° C. overnight is convenient. 

II PLC purification is accomplished by drying down the 
oligonucleotides, resuspending in 1 mLO.lM TEAA (dilute 
2.0M stock in deionized water, filter through 0.2 micron 30 
filter) and filter through 0.2 micron filter. Load 0.5 mL on 
reverse phase HPLC (column can be a Hamilton PRP-1 
semi-prep, #79426). The gradient is 0—50% CH 3 CN over 
25 min (program 0.2 //mol.prep.0-50, 25 min). Pool the 
desired fractions, dry down, resuspend in 200 ju\ 80% HAc. 35 
30 min RT. Add 200 /d EtOH; dry down. Resuspend in 200 
/d H 2 0, plus 20//I NaAc pH5.5, 600 /d EtOH. Leave 10 min 
on ice; centrifuge 12,000 rpm for 10 min in microfugc. Pour 
off supernatant. Rinse pellet with 1 mL EtOH, dry, resuspend 
in 200 /d H20. Dry, resuspend in 200 //I TE. Measure A260, 40 
prepare a 10 pmol/^1 solution in TE (10 mM Tris.Cl pH 8.0, 
0.1 mM EDTA). Following HPLC purification of a 42 mer, 
a yield in the vicinity of 15 nmol from a 0.2 //mol scale 
synthesis is typical. 

2) GENOMIC DNA PREPARATION 45 
For obtaining genomic DNA from human hair, one can 

extract as few as 5 hairs, including hair roots. On a clean and 
sterile surface, one places the hair on a piece of parafilm, and 
after wiping a new razor blade with EtOH cutting off the 
roots, the roots are transferred to a 1.5 mL micro frige tube 50 
using a pair of Millipore forceps cleaned with EtOH. Add 
500 /d (10 mM Tris.Cl pH8.0, 10 mM EDTA, 100 mM NaCl, 
2% (w/v) SDS, 40 mM DTT, filter sterilized) to the sample. 
Add 1.25 /d 20 mg/ml proteinase K (Boehringer) Incubate at 
55° C. for 2 hours, vortexing once or twice. Perform 2x0.5 55 
raL 1:1 phenol: CHCI3 extractions. After each extraction, 
! centrifuge 12,000 rpm 5 min in a micro frige and recover 0.4 
mL supernatant. Add 35 /d NaAc pH5.2 plus 1 mL EtOH. 
Place sample on ice 45 min; then centrifuge 12,000 rpm 30 
min, rinse, air dry 30 min, and resuspend in 100 /d TE. 60 

3) PCR 

PCR is performed in a mixture containing, per reaction: 1 
p\ genomic DNA; 4/d each primer (10 pmol/^1 stocks); 4 /d 
10 xPCR buffer (100 mM Tris.Cl pH8.5, 500 mM KC1, 15 
mM MgCl 2 ); 4 /d 2 mM dNTPs (made from 100 mM dNTP 65 
stocks); 1 U Taq polymerase (Perkin Elmer, 5 U//d); H 2 0 to 
40jUl. About 40 cycles (94° C. 30 sec, 55° C. 30 sec, 72° C. 
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30 sec) are performed, but cycling conditions may need to 
be varied. These conditions are for 0.2 mL thin wall tubes in 
Perkin Elmer 9600. For products in the 200 to 1000 bp size 
range, check 2 /d of the reaction on a 1.5% 0.5xTBE agarose 
gel using an appropriate size standard. For larger or smaller 
volumes (20-100 /d), one can use the same amount of 
genomic DNA but adjust the other ingredients accordingly. 

4) IN VITRO TRANSCRIPTION 

Mix: 3 jul PCR product; 4 fA 5xbuffer; 2 /d DTT; 2.4 /d 10 
mM rNTPs (100 mM solutions from Pharmacia); 0.48 /d 10 
mM fluorescein-UTP (Fluorescein-12-UTP, 10 mM 
solution, from Boehringer Mannheim); 0.5 /d RNA poly- 
merase (Promega T3 or T7 RNA polymerase); and add H 2 0 
to 20/^1. Incubate at 37° C. for 3 h. Check 2 fA of the reaction 
on a 1.5% 0.5xTBE agarose gel using a size standard. 
Sxbuffer is 200 mM Tris pH 7.5, 30 mM MgCl^, 10 mM 
spermidine, 50 mM NaCl, and 100 mM DTT (supplied with 
enzyme). The PCR product needs no purification and can be 
added directly to the transcription mixture. A 20 fA reaction 
is suggested for an initial test experiment and hybridization; 
a 100 fA reaction is considered "preparative" scale (the 
reaction can be scaled up to obtain more target). The amount 
of PCR product to add is variable; typically a PCR reaction 
will yield several picomoles of DNA If the PCR reaction 
does not produce that much target, then one should increase 
the amount of DNA added to the transcription reaction (as, 
well as optimize the PCR). The ratio of fluorescein-UTP to 
UTP suggested above is 1:5, but ratios from 1:3 to 1:10 — all 
work well. One can also label with biotin-UTP and detect 
with streptavidin-FITC to obtain similar results as with 
fluorescein-UTP detection. 

For nondenaturing agarose gel electrophoresis of RNA, 
note that the RNA band will normally migrate somewhat 
faster than the DNA template band, although sometimes the 
two bands will comigrate. The temperature of the gel can 
effect the migration of the RNA band. The RNA produced 
from in vitro transcription is quite stable and can be stored 
for months (at least) at -20° C. without any evidence of 
degradation. It can be stored in unsterilized 6xSSPE 0.1% 
triton X- 100 at -20° C. for days (at least) and reused twice 
(at least) for hybridization, without taking any special pre- 
cautions in preparation or during use. RNase contamination 
should of course be avoided. When extracting RNA from 
cells, it is preferable to work very rapidly and to use strongly 
denaturing conditions. Avoid using glassware previously 
contaminated with RNases. Use of new disposable plas- 
ticware (not necessarily sterilized) is preferred, as new 
plastic tubes, tips, etc., are essentially RNase free. Treatment 
with DEPC or autoclaving is typically not unnecessary. 

5) FRAGMENTATION 

In a 0.2 mL thin-wall PCR tube mix: 18 fA RNA (direct 
from transcription reaction — no purification required); 18 fA 
H ? 0; and 4/d 1M Tris.Cl pH9.0. Incubate at 99.9° C for 60 
min. Add to 1 mL hybridization buffer and store at -20° C. 
until ready to use. The alkaline hydrolysis step is very 
reliable. The hydrolysed target can be stored at -20° C. in 
6xSSPE/0.1% Triton X-100 for at least several days prior to 
use and can also be reused. 

6) HYBRIDIZATION SCANNING, & STRIPPING 

A blank scan of the slide in hybridization buffer only is 
helpful to check that the slide is ready for use. The buffer is 
removed from the flow cell and replaced with 1 mL of 
(hydrolysed) RNA in hybridization buffer and mixed well. 
Incubate for 15-30 min at 18° C. Remove the hybridization 
solution, which can be saved for subsequent experiments. 
Rinse the flow cell 4-5 times with fresh changes of 6xSSPE/ 
0.1% Triton X-100, equilibrated to 18° C. The rinses can be 
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performed rapidly, but it is important to empty the flow cell 
before each new rinse and to mix the liquid in the cell 
thoroughly. The scan is performed in the presence of the 
labeled target. A series of scans at 30 min intervals using a 
hybridization temperature of 25° C. yields a very clear 
signal, usually in at least 30 min to two hours, but it may be 
desirable to hybridize longer, i.e., overnight. Using a laser 
power of 50 and 50 ^m pixels, one should obtain 
maximum counts in the range of hundreds to low thousands/ 
pixel for a new slide. When finished, the slide can be 
stripped using 50% to 100% formamide at 50° C. for 30 min, 
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rinsing well in deionized H 2 0, blowing dry, and storing at 
room temperature. 

These conditions are illustrative and assume a probe 
length of -15 nucleotides. The stripping conditions sug- 
gested are fairly severe, but some signal may remain on the 
slide if the washing is not stringent. Nevertheless, the counts 
remaining after the wash should be very low in comparison 
to the signal in presence of target RNA. In some cases, much 
gender stripping conditions are effective. The lower the 
hybridization temperature and the longer the duration of 
hybridization, the more difficult it is to strip the slide. Longer 
targets may be more difficult to strip than shorter targets. 



SEQUENCE LISTING 



( 1 ) GENERAL INFORMATION: 

(Mi) NUMBER OF SEQUENCES: 360 



( 2 ) INFORMATION FOR SEQ ID NO:l: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pin 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE- DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

TTGCTGACGT CAGCC 

1 5 

( 2 ) INFORMATION FOR SEQ ID NO:2: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

TTGCTGACAT CAGCC 

I 5 

( 2 ) INFORMATION FOR SEQ ID NO:3: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

TTGCTGACCT CAGCC 

1 5 

( 2 ) INFORMATION FOR SEQ ID NO:4: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
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TTGCTGACTT CAGCC 

( 2 ) INFORMATION FOR SEQ ID NO:5: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 39 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (oligonucleotide) 

( * i ) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

CATTAAACAA AATATCATCT TTGGTGTTTC CTATGATGA 

( 2 ) INFORMATION FOR SEQ ID NO:6: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 36 base pair* 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

CATTAAAGAA AATATCATTG GTGTTTCCTA TGATGA 

( 2 ) INFORMATION FOR SEQ ID NO:7: 

( j ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 36 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

CATTAAAGAA AATATCATTG GTGTTTCCTA TGATGA 

( 2 ) INFORMATION FOR SEQ ID NO:R: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pairs 
( B ) TYPE: nocleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

AACACCAATG ATGAT 



( 2 ) INFORMATION FOR SEQ ID NO:9: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 1 ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

CCAAAGATNA TATTT 



1 5 



( 2 ) INFORMATION FOR SEQ ID NO: 10: 
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( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ CD NO:lCh 

ACCAAAGANG ATATT 



( 2 ) INFORMATION FOR SEQ CD NO:ll: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CACCAAAGNT GATAT 



( 2 ) INFORMATION FOR SEQ ID NO: 12: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pairs 
( H ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

ACACCAAANA TGATA 



( 2 ) INFORMATION FOR SEQ ID NO:13: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pairs 
( U ) TYVEz nucleic acid 
( C ) STRANDEDNESS: .single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

AACACCAANG ATGAT 



( 2 ) INFORMATION FOR SEQ ID NO: 14: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

AAACACCANA GATGA 



( 2 ) INFORMATION FOR SEQ ID NO: 15: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: DNA (probe) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 
GAAACACCNA agatg 



( 2 ) INFORMATION FOR SEQ ID NO:16: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: L5 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

GGAAACACNA AAGAT 



( 2 ) INFORMATION FOR SEQ ID NO:17: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

AGGAAACANC A A A G A 



( 2 ) INFORMATION FOR SEQ ID NO:18: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 21 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



( 2 ) INFORMATION FOR SEQ ID NO: 19: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 21 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:l9: 



( 2 ) INFORMATION FOR SEQ ID NO:20: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 44 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 



( x I ) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
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TAATACGACT CACTATAGGG AGATGACCTA ATAATGATGG GTTT 

( 2 ) INFORMATION FOR SEQ ID NO:21: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 43 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

TAATACGACT CACTATAGGG AGTAGTGTGA AGGGTTCATA TGC 

( 2 ) INFORMATION FOR SEQ ID NO:22: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 45 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ [D NO:22: 

CTCGGAATTA ACCCTCACTA AAGGTAGTGT GAAGGGTTCA TATGC 

( 2 ) INFORMATION FOR SEQ ID NO:23: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 43 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

TAATACGACT CACTATAGGG AGAGCATACT AAAAGTGACT CTC 

( 2 ) INFORMATION FOR SEQ ID NO:24: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 44 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

TAATACGACT CACTATAGGG AGACATGAAT GACATTTACA GCAA 

( 2 ) INFORMATION FOR SEQ ID NO:25: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 44 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

CGGAATTAAC CCTCACTAAA GGACATGAAT GACATTTACA GCAA 



( 2 ) INFORMATION FOR SEQ ID NO:26: 
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( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE; nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

TTTATGGGGT GA 



( 2 ) INFORMATION FOR SEQ ID NO:27: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE; DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

TTGATTTATG GG 



( 2 ) INFORMATION FOR SEQ ID NO:28: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( 11 ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:28: 

AACCTATTTG ATT 



( 2 ) INFORMATION FOR SEQ ID NO:29: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B )TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

GGACCAAACC TA 



( 2 ) INFORMATION FOR SEQ ID NO:30: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 1.2 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 

( x I ) SEQUENCE DESCRIPTION: SEQ ID NO:30: 

ACCCTAGGAC ca 



( 2 ) INFORMATION FOR SEQ ID NO:31: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: DNA (probe) 
( x i ) SEQUENCE DESCRIPTION: SEQ tD NO-.31: 
GGTGTGTGTG TGC 



( 2 ) INFORMATION FOR SEQ tD NO:32: 

( 1 ) SEQUENCE CHARACTER [STICS: 
( A ) LENGTH: 14 base pairs 
( B ) TYPE: nucleic odd 
( C ) STRAND EDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

CGGTGTGTGT GTGC 



( 2 ) INFORMATION FOR SEQ ID NO: 33: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 14 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

GGTGTGTGTG TGCT 



( 2 ) INFORMATION FOR SEQ ID NO:34: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

CTGGGTAGGA TG 



( 2 ) INFORMATION FOR SEQ ID NO:35: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:35: 

TGCTGGGTAG GA 



( 2 ) INFORMATION FOR SEQ ID NO:36: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( ! i ) MOLECULE TYPE: DNA (probe) 



( x 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:36: 
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TGTGCTGGCT AG 

( 2 ) INFORMATION FOR SEQ ID NO:37: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

GTTAGCAOCG GT 

( 2 ) INFORMATION FOR SEQ ID NO:38: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

GGGTTAGCAG CG 

( 2 ) INFORMATION FOR SEQ ID NO:39: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:39: 

AGCGGGGGAG G 

( 2 ) INFORMATION FOR SEQ ID NO:40: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B )TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:40: 

AGCGGGGGAG 



( 2 ) INFORMATION FOR SEQ ID NO:4l: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:41: 

CGTTGGTTCG G 



( 2 ) INFORMATION FOR SEQ ID NO:42: 
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( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRAKDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

GGGTTTGGTT GG 



( 2 ) INFORMATION FOR SEQ ID NO:43: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

GATCTTTGGC GT 



( 2 ) INFORMATION FOR SEQ ID NO:44: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( W ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

GGGTGATCTT TG 



( 2 ) INFORMATION FOR SEQ ID NO:45: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( [1 ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:45: 

TGTGGGGGGT GA 



( 2 ) INFORMATION FOR SEQ ID NO:46: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS : single 
( D ) TOPOLOGY: linear 

( M ) MOLECULE TYPE: DNA (probe) 

( x I ) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

TAAACTGTGG GG 



( 2 ) INFORMATION FOR SEQ ID NO:47: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 



5,837,832 

53 

-continued 



( i i ) MOLECULE TYPE: DNA (probe) 
( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:47: 
GCTACATAAA CTG 



( 2 ) INFORMATION FOR SEQ ID NO: 48: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:48: 

GAGGTAAGCT ACA 



( 2 ) INFORMATION FOR SEQ ID NO:49: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS : single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:49: 

GAGGAGGTAA GC 



( 2 ) INFORMATION FOR SEQ ID NO:50: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ tD NO:50: 

TGCTTTGAGG AG 



( 2 ) INFORMATION FOR SEQ ID NO;51: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:51: 

AGTGTATTGC TTT 



( 2 ) INFORMATION FOR SEQ ID NO:52: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x I ) SEQUENCE DESCRIPTION: SEQ ID NO:52: 
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CATTTTCAGT GTA 



( 2 ) INFORMATION FOR SEQ ID NO:53: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDED NESS : single 
( D ) TOPOLOGY: linear 

( i 1 ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

TAAACATTTT CAG 



( 2 ) INFORMATION FOR SEQ ID NO:54: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:54: 

AGCCCGTCTA AA 



( 2 ) INFORMATION FOR SEQ ID NO:55: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i 1 ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:55: 

GAGCCCGTCT AA 



( 2 ) INFORMATION FOR SEQ ID NO:56: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:56: 

TGATGTGAGC CC 



( 2 ) INFORMATION FOR SEQ ID NO:S7: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:57: 

GGGGTGATGT GA 



( 2 ) INFORMATION FOR SEQ ID NO:58: 
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( i ) SEQUENCE CHARACTER [STICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nnclcic acid 
( C ) STRANDED NESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:58: 

GAGTGOGAGG G 



( 2 ) INFORMATION FOR SEQ ID NO:59: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:59: 

GT ATGGGAGT GG 



( 2 ) INFORMATION FOR SEQ ID NO:60: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: L4 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i j ) MOLECULE TYPE: DNA (piobe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:60: 

GATTAGTAGT ATGG 



( 2 ) INFORMATION FOR SEQ ID NO:6l: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ [D NO:61: 

TGAATGAGAT TAG 



( 2 ) INFORMATION FOR SEQ ID NO:62: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS : single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x I ) SEQUENCE DESCRIPTION: SEQ ID NO:62: 

ATTGAATGAG ATT 



( 2 ) INFORMATION FOR SEQ ID NO:63: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nnclcic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: DNA (probe) 
(ji ) SEQUENCE DESCRIPTION: SEQ ID NO:63: 
GGGTTGTATT GAA 

( 2 ) INFORMATION FOR SEQ ID NO:64: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:64: 

GCGGGGGTTG 

( 2 ) INFORMATION FOR SEQ ID NO:65: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:65: 

ATGGGCGGGG 

( 2 ) INFORMATION FOR SEQ ID NO:66: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:66: 

T ACGATGGGC G 

( 2 ) INFORMATION FOR SEQ ID NO:67: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:67: 

TGGGTAGGAT GG 

( 2 ) INFORMATION FOR SEQ ID NO:68: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:68: 
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GTGCTGOGTA GG 

( 2 ) INFORMATION FOR SEQ ID NO:69: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:69: 

TGTGTGTGCT GG 

( 2 ) INFORMATION FOR SEQ ID NO:70: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: L2 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:70: 

GCGGTGTGTG TG 

( 2 ) INFORMATION FOR SEQ ID NO:71: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:71: 

TAGCAGCGGT GT 

( 2 ) INFORMATION FOR SEQ ID NO:72: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:72: 

TGGGGTTAGC AG 

( 2 ) INFORMATION FOR SEQ ID NO:73: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i 1 ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:73: 

GGTATGGGGT TA 



( 2 ) INFORMATION FOR SEQ ID NO:74: 
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( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRAND EDNES S : single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:74: 

GTTCGGGGTA TG 



( 2 ) INFORMATION FOR SEQ ID NO:75: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:75: 

GCTGGTGTTA GG 



( 2 ) INFORMATION FOR SEQ ID NO: 76: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (piobe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:76: 

GGTTAGGCTG GT 



( 2 ) INFORMATION FOR SEQ ID NO:77: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: L3 base pairs 
( 11 ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:77: 

AAATCTGGTT AGG 



( 2 ) INFORMATION FOR SEQ ID NO:78: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: L3 base pairs 
( B ) TYPE: nucleic seid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:78: 

AAATTTGAAA TCT 



( 2 ) INFORMATION FOR SEQ ID NO:79: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:79: 

1 3 

AJVG AT A A A A T TTG 

( 2 ) INFORMATION FOR SEQ ID NO:80: 

( t ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:80: 

1 2 

GCCAAAAAGA TA 

( 2 ) INFORMATION FOR SEQ ID NO:81: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( j i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:81: 

1 1 

CGCCAAAAAG A 

( 2 ) INFORMATION FOR SEQ ID NO:82: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:82: 

1 1 

CATACCCCCA A 

( 2 ) INFORMATION FOR SEQ ID NO:83: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:83: 

1 3 

AAAACTGCAT ACC 



( 2 ) INFORMATION FOR SEQ ID NO:84: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 i ) MOLECULE TYPE: DNA (probe) 




( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:84: 
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TGTTAAAAGT GCA 



( 2 ) INFORMATION FOR SEQ ID NO:85: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:85: 

GGGTGACTGT T A A 



( 2 ) INFORMATION FOR SEQ ID NO:86: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: L2 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( J i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:86: 

GGGGGTGACT GT 



( 2 ) INFORMATION FOR SEQ ID NO:87: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: U base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:87: 

AGTTGGGGGG T 



( 2 ) INFORMATION FOR SEQ ID NO:88: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:88: 

TGTGTTAGTT GGG 



( 2 ) INFORMATION FOR SEQ ID NO:89: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:89: 

AAAATAATGT GTT 



( 2 ) INFORMATION FOR SEQ ID NO:90: 
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( i ) SEQUENCE CHARACTEfUSTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:90: 

AGCCGAAAAT AA 



( 2 ) INFORMATION FOR SEQ ID NO:91: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:91: 

GGAGGGGAAA AT 



( 2 ) INFORMATION FOR SEQ ID NO:92: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( 1) ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:92: 

GGAAATTTTT TG 



( 2 ) INFORMATION FOR SEQ ID NO:93: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B )TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:93: 

GGTGGAAATT TT 



( 2 ) INFORMATION FOR SEQ ID NO:94: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGI"H: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i 1 ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:94: 

GGTTTGGTGG A 



( 2 ) INFORMATION FOR SEQ ID NO:95: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: DNA (probe) 
( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:95: 
GAGGGGGGGT T 



( 2 ) INFORMATION FOR SEQ ID NO:96: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: LO base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: (bear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:96: 

GCGGGGGAGG 



( 2 ) INFORMATION FOR SEQ ID NO:97: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ 10 NO:97: 

CAGAAGCGGG G 



( 2 ) INFORMATION FOR SEQ ID NO:98: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( k i ) SEQUENCE DESCRIPTION: SEQ ID NO:98: 

GTAGGCCAGA AG 



( 2 ) INFORMATION FOR SEQ ID NO:99: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:99: 

GTGCTGTAGG CC 



( 2 ) INFORMATION FOR SEQ ID NO:lOQ: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 



( x i ) SEQUENCE DESCRIPTION: SEQ ID N0:100: 
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TGTTTAAGTG CTG 



( 2 ) INFORMATION FOR SEQ ID NO: 101: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:i01: 



( 2 ) INFORMATION FOR SEQ ID NO: 102: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( | i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 



( 2 ) INFORMAriON FOR SEQ ID NO: 103: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( M ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:103: 



( 2 ) INFORMATION FOR SEQ ID NO: 104: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: LI base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 



( 2 ) INFORMATION FOR SEQ ID NO: 105: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:10S: 



( 2 ) INFORMATION FOR SEQ ID NO:106: 
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( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE; DNA (probe) 

( jt i ) SEQUENCE DESCRIPTION: SEQ ID NO:l06: 

TTTGTTTTTG GG 



( 2 ) INFORMATION FOR SEQ ID NO: 107: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ tt> NO:t07: 

GGGTTCTTTG TT 



( 2 ) INFORMATION FOR SEQ ID NO:10& 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

GTGTTAGGGT TCT 



( 2 ) INFORMATION FOR SEQ ID NO: 109: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 14 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:109: 

TTTAGTAAGT ATGT 



( 2 ) INFORMATION FOR SEQ ID NO:110: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i 1 ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:110: 

AAC ACACTTT AGT 



( 2 ) INFORMATION FOR SEQ ID NO:Lll: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 1 4 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: DNA (probe) 
( x i ) SEQUENCE DESCRIPTION: SEQ ED NOtLII: 
AATTAATTAA CACA 



( 2 ) INFORMATION FOR SEQ ID NO:112: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( jt i ) SEQUENCE DESCRIPTOR SEQ ID NO:ll2: 

AAGCATTAAT TAA 



( 2 ) INFORMATION FOR SEQ ID NO:113: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:113: 

GTCCTACAAC CAT 



( 2 ) INFORMATION FOR SEQ ID NO: 114: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B )TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 1 14: 

TGTCCTACAA GCA 



( 2 ) INFORMATION FOR SEQ ID NO.U5: 

( j ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: US: 

ATTATTATGT CCT 



( 2 ) INFORMATION FOR SEQ ID NO:U6: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 14 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:ll6: 
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TTGTTATTAT TATG 



( 2 ) INFORMATION FOR SEQ ID NO:L17: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 i ) MOLECULE TYPE: DNA (probe) 

( x I ) SEQUENCE DESCRIPTION: SEQ ID NO:U7: 

ATTCAAATTG TTA 



( 2 ) INFORMATION FOR SEQ ID NO: 118: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: L3 base pairs 
( B ) TYPE: nucleic acid 
{ C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:118: 

GCAGACATTC AAA 



( 2 ) INFORMATION FOR SEQ ID NO: 119: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:ll9: 

GCTGTCCAGA CA 



( 2 ) INFORMATION FOR SF.Q ID NO:12ft 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: L2 base pairs 
( B ) TYPE: uucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

AAAGTGGCTG TG 



( 2 ) INFORMATION FOR SEQ ID NO: 121: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:121: 

TGTGTGGAAA G TG 



( 2 ) INFORMATION FOR SEQ ID NO: 122: 
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( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:122: 

GATGTCTGTG TGG 



( 2 ) INFORMATION FOR SEQ CD NO: 123: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:123: 

ATGATGTCTG TGT 



( 2 ) INFORMATION FOR SEQ [D NO: 124: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( j i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:124: 

TTTTGTTATG ATG 



( 2 ) INFORMATION FOR SEQ ID NO: 125: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( H ) TYPE- nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x I ) SEQUENCE DESCRIPTION: SEQ ID NO:125: 

TTTTTTGTTA TGA 



( 2 ) INFORMATION FOR SEQ ID N0:126: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (probe) 

( x I ) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

ATAGGGTGCT CC 



( 2 ) INFORMATION FOR SEQ ID NO: 127; 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: DNA (probe) 
( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:127: 
GCGACATAGG GT 



( 2 ) INFORMATION FOR SEQ ID NO:12&: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( j i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:128: 

TACTGCGACA TAG 



( 2 ) INFORMATION FOR SEQ ID NO: 129: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( j i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:129: 

GACAGATACT gcg 



{ 2 ) INFORMATION FOR SEQ ID NO: 130: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic ocid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:130: 

AATCAAAGAC AGA 



( 2 ) INFORMATION FOR SEQ ID NO: 131: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:l3l: 

AGGAATCAAA GAC 



( 2 ) INFORMATION FOR SEQ ID NO: 132: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOI.ECULE TYPE: DNA (probe) 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 
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TGAGGCAGGA AT 



( 2 ) INFORMATION FOR SEQ ID NO:l33: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x I ) SEQUENCE DESCRIPTION: SEQ ED NO:133: 

AGGATCACGC AG 



( 2 ) INFORMATION FOR SEQ ID NO: 134: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

AAATAATAGG ATG 



( 2 ) INFORMATION FOR SEQ ID NO:135: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N0:135: 

GCGATAAATA AT 



( 2 ) INFORMATION FOR SEQ ID NO: 136: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i 1 ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:136: 

TAGGATGCGA TA 



( 2 ) rNFORMATION FOR SEQ ID NO: 137: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:137: 

GTAGGATGCG AT 



( 2 ) INFORMATION FOR SEQ ID NO:13& 
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( l ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ CD NO:138: 

TTCAACOTAG ga 



( 2 ) INFORMATION FOR SEQ ID NO:139: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ED N0:139: 

AATATTGAAC GTA 



( 2 ) INFORMATION FOR SEQ ID NO: 140: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( II ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:140: 

GCCTGTAATA TTC 



( 2 ) INFORMATION FOR SEQ ID NO: 141: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( 11 ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x I ) SEQUENCE DESCRIPTION: SEQ ID NO:141: 

TGTTCGCCTG TA 



( 2 ) INFORMATION FOR SEQ ID NO: 142: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

(it) MOLECULE TYPE: DNA (probe) 

( x 1 ) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

GTATGTTCGC CT 



( 2 ) INFORMATION FOR SEQ ID NO: 143: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: DNA (probe) 
( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:143: 
CTCCCGTGAG TO 



( 2 ) INFORMATION FOR SEQ ID NO: 144: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:144: 

GAGAGCTCCC GT 



( 2 ) INFORMATION FOR SEQ ID NO: 145: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:l45: 

ATGGAGAGCT CC 



( 2 ) INFORMATION FOR SEQ ID NO: 146: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

AATGCATGGA G A 



( 2 ) INFORMATION FOR SEQ ID NO:t47: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

ATACCAAATG CA 



( 2 ) INFORMATION FOR SEQ ID NO:14& 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:148: 
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GACGAAAATA CCA 

( 2 ) INFORMATION FOR SEQ ID NO: 149: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: lines* 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:149: 

CCCAGACGAA A 

( 2 ) INFORMATION FOR SEQ ID NO:150: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:150: 

TACCCCCCAG A 

( 2 ) INFORMATION FOR SEQ ID NO:15l: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: LI base pairs 
< B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

TGCATACCCCC 

( 2 ) INFORMATION FOR SEQ ID NO: 152: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

TCGCGTGCAT AC 



( 2 ) INFORMATION FOR SEQ ID NO: 153: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N0:153: 

GACTATCGCG TG 



( 2 ) INFORMATION FOR SEQ ID NO: 154: 
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( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x I ) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

ATCACTATCG cg 



( 2 ) INFORMATION FOR SEQ ID NO:155: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:155: 

CTCGCAATGA CT 



( 2 ) INFORMATION FOR SEQ ID NO: 156: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE; nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 

CGTCTCGCAA TG 



( 2 ) INFORMATION FOR SEQ ID NO:157 : 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( U ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ [DNO:157: 

CTCCAGCGTC TC 



( 2 ) INFORMATION FOR SEQ ID NO:15& 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

TCCGGCTCCA G 



( 2 ) INFORMATION FOR SEQ ID NO: 159: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B )TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( j i ) MOLECULE TYPE: DNA (probe) 
( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:159: 
GTGCTCCGGC T 



( 2 ) INFORMATION FOR SEQ ID NO:160: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDED NESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:L60: 

GACCCTGAAG TAG 



( 2 ) INFORMATION FOR SEQ ID NO:161: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDED NESS : single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:161: 

TTTATGACCC TGA 



( 2 ) INFORMATION FOR SEQ ID NO: 162: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 162; 

TTTAGGCTTT ATG 



( 2 ) INFORMATION FOR SEQ ID NO: 163: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:163: 

GCTATTTAGG CT 



( 2 ) INFORMATION FOR SEQ ID NO:164: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 i ) MOLECULE TYPE: DNA (probe) 

( x 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:164: 
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TGGGCTATTT AG 



( 2 ) INFORMATION FOR SEQ ID NO:165: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:165: 



( 2 ) INFORMATION FOR SEQ ID N0:166: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 



( 2 ) INFORMATION FOR SEQ ID NO: 167: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 



( 2 ) INFORMATION FOR SEQ ID NO: 16ft: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 14 base pairs 
( B ) TYPE- nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 



( 2 ) INFORMATION FOR SEQ ID NO: 169: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 



( 2 ) INFORMATION FOR SEQ ID NO:170. 
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( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:170: 

TCCATCGTGA TG 



( 2 ) INFORMATION FOR SEQ ID NO:L71: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( J i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 



( 2 ) INFORMATION FOR SEQ ID NO: 172: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 



( 2 ) INFORMATION FOR SEQ ID NO: 173: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( II ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:173: 



( 2 ) INFORMATION FOR SEQ ID NO: 174: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 



( 2 ) INFORMATION FOR SEQ ID NO: 175: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS : single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: DNA (probe) 
( X i ) SEQUENCE DESCRIPTION: SEQ ID N0:175: 
TGGTTAATAG GG 



( 2 ) INFORMATION FOR SEQ ID NO: 176: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DMA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:176: 

GTGAGTGGTT A A T 



(, 2 ) INFORMATION FOR SEQ ID NO: 177: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:177: 

TGTGCGGGAT AT 



( 2 ) INFORMATION FOR SEQ ID NO: 178: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 17ft: 

ACTCTTGTGC GG 



( 2 ) INFORMATION FOR SEQ ID NO: 179: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 

TAGCACTCTT GTG 



( 2 ) INFORMATION FOR SEQ ID NO:180: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:180: 
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GGAGAGTAGC ACT 



( 2 ) INFORMATION FOR SEQ. ID NO:i81: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

(II) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:181: 

GC GAGG AG AG TA 



( 2 ) INFORMATION FOR SEQ ID NO: 182: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 

CGGAGCGAGG A 

1 1 



( 2 ) INFORMATION FOR SEQ II) NO: 183: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( J i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

GGCCCGGAGC 

1 0 



( 2 ) INFORMATION FOR SEQ ID NO: 184: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:184: 

TTATGGGCCC G 

1 1 

( 2 ) INFORMATION FOR SEQ [D NO: 185: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: L2 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I I ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:l85: 

AGTGTTATGG GC 



( 2 ) INFORMATION FOR SEQ ID NO:I8<5: 
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( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( jt i ) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 

TACCCCCAAG TG 



( 2 ) INFORMATION FOR SEQ ID NO:187: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( j i ) MOLECULE TYPE: DNA (piobe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:187: 

TTTAGCTACC CC 



( 2 ) INFORMATION FOR SEQ ID NO: 188: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

TTCACTTTAG CTA 



( 2 ) INFORMATION FOR SEQ ID NO: 189: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:189: 

TACAGTTCAC TTT 



( 2 ) INFORMATION FOR SEQ ID NO: 190. 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i 1 ) MOLECULE TYPE: DNA (probe) 

( x 1 ) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 

TCGAOATACA GTT 



( 2 ) INFORMATION FOR SEQ ID NO: 191: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nocleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: DNA (probe) 
( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 
CAGATGTCGA GAT 13 



( 2 ) INFORMATION FOR SEQ ID NO: 192: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:192: 



( 2 ) INFORMATION FOR SEQ ID NO: 193: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:l93: 



( 2 ) INFORMATION FOR SEQ ID NO: 194: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 194; 



( 2 ) INFORMATION FOR SEQ ID NO: 195: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-.L95: 



( 2 ) INFORMATION FOR SEQ ID NO: 196: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 



( x 1 ) SEQUENCE DESCRIPTION: SEQ IDNO:196: 
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AGGCATTTGA CT 

( 2 ) INFORMATION FOR SEQ ID NO:197: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:L97: 

AC G AG A AGGG AT 

( 2 ) INFORMATION FOR SEQ ID NO:198: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:198: 

TGGGGACGAG AA 

( 2 ) INFORMATION FOR SEQ ID NO: 199: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 

ATCCATGGGG AC 

( 2 ) INFORMATION FOR SEQ ID NO:20ft 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nocleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:200: 

GGTCATCCAT GG 



( 2 ) INFORMATION FOR SEQ ID NO:201: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: U base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS : single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:201: 

AGGGGGGTCA T 



( 2 ) INFORMATION FOR SEQ ID NO:202: 
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( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRAKDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: D.VA (probe) 

( x 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:202: 

TATCTGAGGG GG 



( 2 ) INFORMATION FOR SEQ ID NO:203: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:203: 



( 2 ) INFORMATION FOR SEQ ID NO:204: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:204: 



( 2 ) INFORMATION FOR SEQ ID NO:205: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:205: 



( 2 ) INFORMATION FOR SEQ ID NO:206: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

(II) MOLECULE TYPE: DNA (probe) 

( x I ) SEQUENCE DESCRIPTION: SEQ ID NO:206: 



( 2 ) INFORMATION FOR SEQ ID NO:207; 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: DNA (probe) 
( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:207: 
AGGATGGTGG TC 

( 2 ) INFORMATION FOR SEQ ID NO:20& 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic odd 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:208: 

ACACGGAGGA TG 



( 2 ) INFORMATION FOR SEQ CD NO:209: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:209: 

TGATTTACAC GG 



( 2 ) INFORMATION FOR SEQ ID NO:210: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

{ x i ) SEQUENCE DESCRIPTION: SEQ ID NO:2 10: 



( 2 ) rNFORMATION FOR SEQ ID NO:2U: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:211: 



( 2 ) INFORMATION FOR SEQ ID N0:212: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:212: 
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AGGGGTCGCA T 

( 2 ) INFORMATION FOR SEQ ID NO:213: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: U base pairs 
( B ) TYPE: nocleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:213: 

GGTG AGGGGT G 

( 2 ) INFORMATION FOR SEQ ID N0:214: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:2U: 

AGTGGGTGAG GG 

( 2 ) INFORMATION FOR SEQ [[) N():215: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:215: 

GTATCCTAGT GGG 

( 2 ) INFORMATION FOR SEQ ID N0:216: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:216: 

TTTGTTGGTA TCC 



( 2 ) INFORMATION FOR SEQ ID NO:217: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i 1 ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N0:217: 

GTAGGTTTGT TGC 



( 2 ) INFORMATION FOR SEQ ID NO:2l& 
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( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:218: 

TGGGTAOGTT TG 



( 2 ) INFORMATION FOR SEQ ID NO:219: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: L2 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:219: 



( 2 ) INFORMATION FOR SEQ ID NO:220: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( II ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:220: 



( 2 ) INFORMATION FOR SEQ ID NO:221: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 14 base pairs 
( It ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:22L: 



( 2 ) INFORMATION FOR SEQ ID NO:222: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 I ) MOLECULE TYPE: DNA (probe) 

( x I ) SEQUENCE DESCRIPTION: SEQ ID NO:222: 



( 2 ) INFORMATION FOR SEQ ID NO:223: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i j ) MOLECULE TYPE: DNA (probe) 
( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:223: 
AAATGGCTTT AT 

( 2 ) INFORMATION FOR SEQ ID NO:224: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic ocid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( jc i ) SEQUENCE DESCRIPTION: SEQ ID NO:224: 

GGTAAATGGC TT 

( 2 ) INFORMATION FOR SEQ ID NO:225: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:225: 

TGTACGGTAA ATG 



( 2 ) INFORMATION FOR SEQ ID NO:226: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic ocid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:226: 

GTGCTAATGT ACG 



( 2 ) INFORMATION FOR SEQ ID NO:227: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic ocid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:227: 



( 2 ) INFORMATION FOR SEQ ID NO:22& 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:228: 
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CATGGGGAGG G 



( 2 ) INFORMATION FOR SEQ ID NO:229: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 1 ) MOLECULE TYPE: DNA (probe) 

(k!) SEQUENCE DESCRIPTION: SEQ ID NO:229: 



( 2 ) INFORMATION FOR SEQ ID NO:230: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i 1 ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:230: 



( 2 ) INFORMATION FOR SEQ ID NO:23t: 

( j ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i 1 ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:231: 



( 2 ) INFORMATION FOR SEQ ID NO:232: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:232: 



( 2 ) INFORMATION FOR SEQ ID NO:233: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:233: 



( 2 ) INFORMATION FOR SEQ ID NO:234: 
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( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:234: 

TTGAGGGTTC AT 

( 2 ) INFORMATION FOR SEQ ID NO:235: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:235: 

GTGATAGTTG AGO 

( 2 ) INFORMATION FOR SEQ ID NO:236: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x I ) SEQUENCE DESCRIPTION: SEQ ID NO:236: 

TTGATGTCTG ATA 



( 2 ) INFORMATION FOR SEQ ID NO:237; 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( R ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x I ) SEQUENCE DESCRIPTION: SEQ ID NO:237: 

TGCAGTTGAT GTG 



( 2 ) INFORMATION FOR SEQ ID NO:23& 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:238: 



( 2 ) INFORMATION FOR SEQ ID NO:239: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 ba*e pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: DNA (probe) 
( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:239: 
ATTTGGAGTT GC 

( 2 ) INFORMATION FOR SEQ ID NO:240: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic odd 
( C ) STRAND EDNESS : single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:240: 

TACCGTACAA TAT 

( 2 ) INFORMATION FOR SEQ ID NO:241: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:241: 

TGGTACCGTA CAA 



( 2 ) INFORMATION FOR SEQ ID NO:242: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:242: 

TATTTATGGT ACC 



( 2 ) INFORMATION FOR SEQ ID NO:243: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:243: 



( 2 ) INFORMATION FOR SEQ ID NO:244: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO:244: 
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TACACGTGGT CAA 



( 2 ) INFORMATION FOR SEQ ID NO:245: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:245: 

ATGTACTACA GGT 



( 2 ) INFORMATION FOR SEQ ID NO:246: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

(It) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 2 46: 

GGTTTTTATG TAC 



( 2 ) INFORMATION FOR SEQ ID NO:247: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:247: 



( 2 ) INFORMATION FOR SEQ ID NO:24R: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:248: 



( 2 ) INFORMATION FOR SEQ ID NO:249: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:249: 



( 2 ) INFORMATION FOR SEQ ID NO:250: 
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( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x t ) SEQUENCE DESCRIPTION: SEQ ED NO:250: 

GGGTTTTGAT GT 



( 2 ) INFORMATION FOR SEQ ID NO:251: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: LI base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( k i ) SEQUENCE DESCRIPTION: SEQ ID NO:251: 



( 2 ) INFORMATION FOR SEQ ID NO:252: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:252: 



( 2 ) INFORMATION FOR SEQ ID NO:253: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( II ) 1YPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:253: 



( 2 ) INFORMATION FOR SEQ ID NO:254: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:254: 



( 2 ) INFORMATION FOR SEQ ID NO:255: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
{ B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: DNA (probe) 
( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:255: 
TGTTGATGGG TG 1 2 



( 2 ) INFORMATION FOR SEQ ID NO:256: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: L2 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE; DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:256: 



( 2 ) INFORMATION FOR SEQ ID NO:257: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS : single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:257: 



( 2 ) INFORMATION FOR SEQ [D NO:25& 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ !D NO:25ft: 



( 2 ) INFORMATION FOR SEQ ID NO:259: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS : single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:259: 



( 2 ) INFORMATION FOR SEQ ID NO:260s 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:260: 



133 



5,837,832 

-continued 



134 



GCAGTAATGT A C G 

( 2 ) [NFORMATION FOR SEQ ID NO:261: 

( j ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRAND EDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:261: 

TGGCTGGCAG TA 

( 2 ) INFORMATION FOR SEQ ID NO:262: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:262: 

TCATGGTGGC TG 



( 2 ) INFORMATION FOR SEQ ID NO:263: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:263: 

ACAATATTCA TGG 



( 2 ) INFORMATION FOR SEQ tD NO:264: 

( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:264: 



( 2 ) INFORMATION FOR SEQ ID NO:265: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: L3 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:265: 



( 2 ) INFORMATION FOR SEQ ID NO:266: 



135 



5,837,832 



-continued 



136 



( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( 0 ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:266: 

GAATAAGTTT AAA 



( 2 ) INFORMATION FOR SEQ ID NO:267: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:267: 



( 2 ) INFORMATION FOR SEQ ID NO:26& 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:268: 



( 2 ) INFORMATION FOR SEQ ID NO:269: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) 1YCE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:2<59: 



( 2 ) INFORMATION FOR SEQ ID NO:270: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:270: 



( 2 ) INFORMATION FOR SEQ ID NO:271: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: DNA (probe) 
( x i ) SEQUENCE DESCRIPTION: SEQ CD NO:27l: 
ATCTGCTTCC CC 



( 2 ) INFORMATION FOR SEQ DO NO:272: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:272: 

CAAATCTGCT TC 



( 2 ) INFORMATION FOR SEQ ID NO:273: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:273: 



( 2 ) INFORMATION FOR SEQ ID NO:274: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:274: 



( 2 ) INFORMATION FOR SEQ ID NO:275: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:27S: 



( 2 ) INFORMATION FOR SEQ ID NO:276: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:276: 
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TGGAAAAAGG TT 

( 2 ) INFORMATION FOR SEQ ID NO:277: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: L2 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:277: 

GTCCTTGGAA AA 

( 2 ) INFORMATION FOR SEQ ID NO:27& 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:278: 

ATTTGTCCTT GG 

( 2 ) INFORMATION FOR SEQ ID N():279: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:279: 

CTCTGATTTG TCC 

( 2 ) INFORMATION FOR SEQ ID NO:280: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:280: 

TTTTTCTCTG ATT 



( 2 ) INFORMATION FOR SEQ ID NO:28l: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 I ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:2Sl: 

TAAAGACTTT TTC 



( 2 ) INFORMATION FOR SEQ ID NO:282: 
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( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDED NESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:282: 

GTGGAGTTAA AGA 



( 2 ) INFORMATION FOR SEQ ID NO:283: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 13 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:2S3: 



( 2 ) INFORMATION FOR SEQ ID NO:284: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:284: 



( 2 ) INFORMATION FOR SEQ ID NO:2S5: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( R ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:285: 



( 2 ) INFORMATION FOR SEQ ID NO:286: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY; linear 

( I 1 ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:286: 



( 2 ) INFORMATION FOR SEQ ID NO:287; 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: DNA (probe) 
( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:287: 
TCTTAGCTTT GG 



( 2 ) INFORMATION FOR SEQ ID NO:28& 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRAND ED NESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:288: 

CACTTGTGCC CTGACTTTCA AC 



( 2 ) INFORMATION FOR SEQ ID NO:289: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 49 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ 11) NO:289: 

AT GCAATTAA CCCTCACTAA AGGGAGACAC TTGTGCCCTG ACTTTCAAC 49 



( 2 ) INFORMATION FOR SEQ ID NO:290: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 25 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:290: 

GAC C CTGGG C AACCAGCCCT GTCGT 



( 2 ) INFORMATION FOR SEQ ID NO:291: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 47 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:291: 

TAATACCACT CACTATAGGG AGGACCCTGG GCAACCAGCC CTGTCGT 47 



( 2 ) INFORMATION FOR SEQ ID NO:292: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 25 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

(it) MOLECULE TYPE: DNA (probe) 



( x I ) SEQUENCE DESCRIPTION: SEQ ID NO:292: 
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GTAGAATTCT GTTGACTCAG ATTGG 



( 2 ) INFORMATION FOR SEQ ID NO:293: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 27 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:293: 

AAATCCATAC AATACTCCAG TATTTGC 



( 2 ) [NFORMATION FOR SEQ ID NO:294: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 27 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:294: 



( 2 ) INFORMATION FOR SEQ If) NO:2*)5: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 28 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:295: 



( 2 ) [NFORMATION FOR SEQ ID NO:296: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: L2 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (oligonucleotide) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:296: 



( 2 ) INFORMATION FOR SEQ ID NO:297: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (oligonucleotide) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:297: 



( 2 ) INFORMATION FOR SEQ ID NO:298: 
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( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( 3t 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:298: 

CGGAATTAAC CCTCACTAAA GG 

( 2 ) INFORMATION FOR SEQ ID NO:299: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: stogie 
( D ) TOPOLOGY: linear 

( i j ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:299: 

AATTAACCCT CACTAAAGCG AG 

( 2 ) INFORMATION FOR SEQ ID NO:300: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:300: 

TAATACGACT CACTATAGGG AG 



( 2 ) INFORMATION FOR SEQ ID NO:301: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 20 base pairs 
( il ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:30L: 

ATTTAGGTGA CACTATAGAA 



( 2 ) INFORMATION FOR SEQ ID NO:302: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 

( x 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:302: 



( 2 ) INFORMATION FOR SEQ ID NO:303: 

( i ) SEQUENCE CHARACTERISTICS; 

( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 



5,837,832 

149 150 

•continued 



( i i ) MOLECULE TYPE: DNA (probe) 
( x i ) SEQUENCE DESCRIPTION: SEQ ED NO:303: 
AGANGATATT 10 

( 2 ) INFORMATION FOR SEQ ID NO:304: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic odd 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:304: 

AAGNT GATAT 1 0 



( 2 ) INFORMATION FOR SEQ ID NO:305: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:305: 



( 2 ) INFORMATION FOR SEQ tD NO:306: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic ocid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:306: 



( 2 ) INFORMATION FOR SEQ ID NO:307: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:307: 



( 2 ) INFORMATION FOR SEQ ID NO:308: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:308: 
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ACCNAACATG 

( 2 ) INFORMATION FOR SEQ ID NO:309: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 

( x I ) SEQUENCE DESCRIPTION: SEQ ID NO:309: 

C ACNAAAGAT 

( 2 ) INFORMATION FOR SEQ ID NO:310: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:310: 

AGAAACNACA 

( 2 ) INFORMATION FOR SEQ ID NO:3ll: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 16 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:311: 

ATTTCATTCT GTATTG 

( 2 ) INFORMATION FOR SEQ ID N0:312: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 16 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:312: 

CCGACTGCAG TCGTTA 



( 2 ) INFORMATION FOR SEQ ID NO:3t3: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( I i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:313: 

CCGACTGCAG TCGTT 



1 5 



( 2 ) INFORMATION FOR SEQ ID NO:3U: 
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( I ) SEQUENCE CHARACTER [STJCS: 
( A ) LENGTH: 15 base pairs 
( B ) TYPE onclcic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO-.314: 

CCCACTACAG TCGTT 



( 2 ) INFORMATION FOR SEQ ID NO:315: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pair* 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:315: 

CCGACTCCAG TCGTT 



( 2 ) INFORMATION FOR SEQ ID NO:316: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 15 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:316": 



( 2 ) INFORMATION FOR SEQ ID NO:317: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 35 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:317: 



( 2 ) INFORMATION FOR SEQ ID NO:31& 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 35 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (oligonucleotide) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:318: 



( 2 ) INFORMATION FOR SEQ ID NO:319: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: DNA (oligonucleotide) 
( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:319: 

CATTAAAGAA aatatcattg gtgtttccta tg 



( 2 ) INFORMATION FOR SEQ ID NO:320: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 18 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:320: 

CATTAAAGAA AATATCAT 



( 2 ) INFORMATION FOR SEQ ID NO:321: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 35 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (oligonucleotide) 

( x i ) SEQUENCE DESCRIPI ION: SEQ ID NO:32l: 

TATTAAAGAA AATATCATCT TTGGTGTTTC CTATC 



( 2 ) INFORMATION FOR SEQ ID NO:322: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 35 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (oligonucleotide) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:322: 

CCTTAAAGAA AATATCATCT TTGGTGTTTC CTAAA 35 



( 2 ) INFORMATION FOR SEQ ID NO:323: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 35 base pairs 
( B ) TYPE: nucleic ocid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (oligonucletidc) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:323: 

CTTTAAAGAA AATAAAAAAA TTGGTGTTTC CTAAA 



( 2 ) INFORMATION FOR SEQ ID NO:324: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 20 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:324: 
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GGAAGTCTCC CATTTTAATT 



( 2 ) INFORMATION FOR SEQ ED NO:325: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH. 20 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:325: 

CCTTCAGAGG GTAAAATTAA 



( 2 ) INFORMATION FOR SEQ ID NO:326: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 20 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:326: 



( 2 ) INFORMATION FOR SEQ ED NO:327: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 20 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:327: 



( 2 ) INFORMATION FOR SEQ ID NO:32R: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 19 base pairs 
( B ) TYPE: nucleic add 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:328: 



( 2 ) INFORMATION FOR SEQ ID NO:329: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 19 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:329: 



( 2 ) INFORMATION FOR SEQ ID NO:330: 
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( i ) SEQUENCE CHARACTER [STICS: 
( A ) LENGTH: 19 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRAND ED NESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE- DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ tD NO:330: 

GATTCAGAGT GTAAAATAC 

( 2 ) INFORMATION FOR SEQ ID NO:331: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 19 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:331: 

AAAAAAGAGT GTAAAATGA 

( 2 ) INFORMATION FOR SEQ ID NO:332: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 35 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (oligonucleotide) 

( x i ) SEQUENCE DESCRIPTION: SEQ ED NO:332: 

CATTAAAGAA AATAACATCA TTGGTGTTTC CTATG 



( 2 ) INFORMATION FOR SEQ [D NO:333: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 648 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (oligonucleotide) 

( x 1 ) SEQUENCE DESCRIPTION: SEQ ID NO:333: 

AACAAACCTA CCCACCCTTA ACAGTACATA GTACATAAAG CCATTTACCG TACATAGCAC 60 

ATTACAGTCA AATCCCTTCT CGTCCCCATG GATGACCCCC CTCAGAT AGG GGTCCCTTGA 120 

CCACCATCCT CCGTGAAATC AATATCCCGC ACAAGAGTGC TACTCTCCTC GCTCCGGGCC 180 

CATAACACTT GGGGGTAGCT AAAGTGAACT GTATCCGACA TCTGGTTCCT ACTTCAGGGT 240 

CATAAAGCCT AAATAGCCCA CACGTTCCCC TTAAATAAGA CATCACGATG GATCACAGGT 300 

CTATCACCCT ATTAACCACT CACGGGAGCT CTCCATGCAT TTGGTATTTT CGTCTGGGGG 360 

GTATGCACGC GATAGCATTC CGAGACGCTG OAGCCGGAGC ACCCTATGTC GCAGTATCTG 420 

TCTTTGATTC CTGCCTCATC CTATTATTTA TCGCACCTAC GTTCAATATT ACAGGCGAAC 480 

ATACTTACTA AAGTGTGTTA ATTAATTAAT GCTTGTAGGA CATAATAATA ACAATTCAAT 540 

GTCTGCACAG CCACTTTCCA CACAGACATC ATAACAAAAA ATTTCCACCA AACCCCCCCT 600 

CTCCCCCGCT TCTGGCCACA GCACTTAAAC ACATCTCTGC CAAACCCC 648 



( 2 ) INFORMATION FOR SEQ ED NO:334: 




5,837,832 

161 162 

-continued 



( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( 0 ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:334: 



( 2 ) INFORMATION FOR SEQ ID NO:335: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:335: 



( 2 ) INFORMATION FOR SEQ ID NO:336: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) 1YPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 3 36: 



( 2 ) INFORMATION FOR SEQ ID NO:337: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( II ) TYPE: nucleic acid 
( C ) STRANDEDNESS; single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

(iti) SEQUENCE DESCRIPTION: SEQ ID NO:337: 



( 2 ) INFORMATION FOR SEQ ID NO:33& 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:338: 



( 2 ) INFORMATION FOR SEQ ID NO:339: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: noclcic acid 
{ C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 
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( i i ) MOLECULE TYPE: DNA (probe) 
( it i ) SEQUENCE DESCRIPTION: SEQ ID NO:339: 
ACGACTCCTC CC 



1 2 



( 2 ) INFORMATION FOR SEQ ID NO:340: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:340: 

TACGACTCCT CC 

( 2 ) INFORMATION FOR SEQ ID NO:341: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPHON: SEQ ID NO:341: 

CTACGACTCC TC 

( 2 ) INFORMATION FOR SEQ ID NO:342: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nocleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:342: 

TCTACGACTC CT 

( 2 ) INFORMATION FOR SEQ ID NO:343: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:343: 

TTCTACGACT CC 



( 2 ) INFORMATION FOR SEQ ID NO:344: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:344: 
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ATTCTACGAC TC 

( 2 ) INFORMATION FOR SEQ W NO:345: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:345: 

TATTCTACGA CT 



( 2 ) INFORMATION FOR SEQ ID NO:346: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:346: 



( 2 ) INFORMATION FOR SEQ ID NO:347: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 12 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:347: 



( 2 ) INFORMATION FOR SEQ ID NO:348: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i I ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:348: 



( 2 ) INFORMATION FOR SEQ ID NO:349: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( 1 I ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:349: 



( 2 ) INFORMATION FOR SEQ ID NO:350: 
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( i i ) MOLECULE TYPE; DNA (probe) 
( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:355: 
CT ACGACTCC 



( 2 ) INFORMATION FOR SEQ ID NO:356: 

( I ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic odd 
( C ) STRAND EDNESS : single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE; DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:356: 

T C TACGACTC 



( 2 ) INFORMATION FOR SEQ ID NO:357: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B )TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:3S7: 

TTCTACGACT 



( 2 ) INFORMATION FOR SEQ ID NO:35& 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucteic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( j i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:358: 

ATT C TA CG AC 



( 2 ) INFORMATION FOR SEQ ID N0.359: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 10 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (probe) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:359: 

T AT T CT ACGA 



( 2 ) INFORMATION FOR SEQ ID NO:360t 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 184 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (oligonucleotide) 



( x 1 ) SEQUENCE DESCRIPTION: SEQ (D NO:360: 
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TACTCCCCTG 


CCCT CAACAA 


GATGTTTTGC 


C AACTGGCCA 


AGACCTGCCC 


TGTGCAGCWG 


6 0 


KGC GWWG ATT 


CCACACCCCC 


GCCCGGCACC 


CGCGTCCGCG 


CC AT GG C CAT 


CT ACAAGCAG 


1 2 0 


TCACAGCACA 


TGACGGAGOW 


WGKGAGGCGC 


TGC C CCCACC 


ATGAGCGC YG 


C YCAGAT AGC 


1 8 0 


S AYC 
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We claim: 

1. An array of oligonucleotide probes immobilized on a 
solid support, said array having at least 100 probes and no 
more than 100,000 different oligonucleotide probes 9 to 20 
nucleotides in length occupying separate known sites in said 
array, said oligonucleotide probes comprising at least four 
sets of probes: (1) a first set that is exactly complementary 
to a reference sequence and comprises probes that com- 
pletely span the reference sequence and, relative to the 
reference sequence, overlap one another in sequence; and (2) 
three additional sets of probes, each of which is identical to 
said first set of probes but for at least one different 
nucleotide, which different nucleotide is located in the same 
position in each of the three additional sets but which is a 
different nucleotide in each set. 

2. The array of claim 1, further comprising a fourth 
additional set of probes, which fourth additional set is 
identical to probes in the first set. 

3. The array of claim 1, wherein said reference sequence 
is a double-stranded nucleic acid and probes complementary 
to both strands of said reference are in said array. 

4. The array of claim 1, wherein said probes are 12 to 17 
nucleotides in length. 

5. The array of claim 4, wherein said probes are 15 
nucleotides in length and attached by a covalent linkage to 
a site on a 3'-end of said probes, and said different nucleotide 
is located at position 7, relative to the 3' -end of said probes. 

6. The array of claim 1, wherein said reference sequence 
is exon 10 of a CFTR gene, and said array has between 1000 
and 100,000 oligonucleotide probes 10 to 18 nucleotides in 
length. 

7. The array of claim 6, wherein said array comprises a set 
of probes comprising a specific nucleotide sequence selected 
from the group of sequences consisting of: 
3 , -TTTATAXTAG (SEQ ID. NO:302); 
3'-TTATAGXAGA (SEQ ID. NO:303); 
3'-TATAGTXGAA (SEQ ID. NO:304); 
3'-ATAGTAXAAA (SEQ ID. NO:305); 

3' TAGTAGXAAC (SEQ ID. NO:306); 
3'-AGTAGAXACC (SOQ ID. NO:307); 
3'-GTAGAAXCCA (SEQ ID. NO:308); 
3'-TAGAAAXCAC (SEQ ID. NO:309); and 
3'-AGAAACXACA (SEQ ID. NO:310); wherein each set 

comprises 4 probes, and X is individually A, G, C, and T 

for each set. 

8. The array of claim 6, wherein said group of sequences 
consists of: 

3'-TTTATAXTAGAAACC (SEQ ID. NO:9); 
3'-TTATAGXAGAAACCA (SEQ ID. NO:10); 
3'-TATAGTXGAAACCAC (SEQ ID. NO:ll); 
3'-ATAGTAXAAACCACA (SEQ ID. NO:12); 



3-TAGTAGXAACCACAA (SEQ ID. NO:13); 
3'-AGTAGAXACCACAAA (SEQ ID. No:14); 
3'-GTAGAAXCCACAAAG (SEQ ID. NO: 15); 
3'-TAGAAAXCACAAAGG (SEQ ID. NO: 16); and 
15 3'-AGAAACXACAAAGGA (SEQ ID. NO: 17); wherein 

each set comprises 4 probes, and X is individually A, G, 

C, and T for each set. 

9. The array of claim 1, wherein said reference sequence 
is a sequence of a D-loop region of human mitochondrial 

20 DNA. 

10. The array of claim 9, wherein said probes are 15 
nucleotides in length, and said array comprises a first set of 
probes exactly complementary to a sequence contained in a 
sequence bounded by positions 16280 to 356 of the refer- 

25 ence sequence and four additional sets of probes identical to 
said first set but for position 7, relative to a 3'-end of a probe, 
which 3'-end is covalently attached to the substrate, where, 
for each of the four additional probe sets, a different nucle- 

30 otide is located, such that, for each probe in said first set, 
there is an identical probe in one of the four additional sets, 
and such that the array has between 2500 and 100,000 
oligonucleotide probes. 

11. The array of claim 1, wherein said reference sequence 
35 is a sequence from an exon of a human p53 gene. 

12. The array of claim 11, wherein said reference 
sequence comprises at least a 60 nucleotide contiguous 
sequence from exon 6 of a p53 gene. 

13. The array of claim 11, wherein said reference 
40 sequence is exon 5 of a p53 gene, said probes are 17 

nucleotides long, and said array comprises a first set of 
probes exactly complementary to said sequence and at least 
three additional sets of probes, each set comprising probes 
identical to said first set but for a nucleotide at position 7, 
45 relative to a 3'-end of a probe, which 3'-end is covalently 
attached to the substrate, which nucleotide is different from 
a nucleotide at this position in a corresponding probe of said 
first set. 

14. The array of claim 1, wherein said probes arc oli- 
50 godeoxy ribonucleotides. 

15. The array of claim 1, wherein said array has between 
10,000 and 100,000 probes. 

16. The array of claim 1, wherein the reference sequence 
is from a human immunodeficiency virus. 

55 17. Irie array of claim 16, wherein the reference sequence 
is from a reverse transcriptase gene of the human immuno- 
deficiency virus. 

18. The array of claim 1, wherein said probes are immo- 
bilized to said solid support via a linker. 

* * # * * 



